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Matthew  Kabrisky,  professors  of  Electrical  Engineering, 

Air  Force  Institute  of  Technology.  This  research  used 
Seelandt 's  phonemes  as  a  basis  for  determining  an  optimum 
length  of  the  phonemes.  In  addition,  speech  synthesis  was 
attempted  using  single  vector  or  time  slice  phonemes. 

I  would  like  to  thank  my  advisors  Maj.  Larry  Kizer 
and  Dr.  Matthew  Kabrisky  for  their  patience  and  guidance  dur 
ing  this  project. 
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help  as  a  partner  during  my  work.  She  has  given  me  moral 
support  during  my  research  and  without  her  contributions 
this  project  could  never  have  been  completed. 
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Abstract 


An  analysis  of  speech  is  made  by  comparing  single, 
two  and  three  time  slice  phonemes  to  five  time  slice 
phonemes  (.08  sec/time  slice).  All  phonemes  were  created 
from  the  same  speech  with  the  single,  two  and  three  time 
slice  phonemes  created  from  portions  of  the  five  time 
slice  phonemes.  It  was  found  that  the  single  time  slice 
phonemes  compared  favorably  with  the  five  time  slice 
phonemes  in  recognizing  speech  only  if  the  frequency  com¬ 
ponents  remained  relatively  constant  over  a  period  of  time, 
such  as  the  vowel  and  nasal  sounds.  The  two  time  slice 
phonemes  showed  results  that  began  to  duplicate  those  of 
the  five  time  slice  phonemes,  but  still  had  inconsistent 
results  identifying  fricative  sounds.  Three  time  slice 
phonemes  results  showed  a  closer  correlation  with  the  re¬ 
sults  of  the  five  time  slice  phonemes  than  those  of  the 
one  and  two  time  slice  phonemes.  All  results  were  obtained 
using  a  64  point  sampled,  Hamming  windowed,  Discrete  Fourier 
Transform.  The  recognition  results  for  each  time  slice  of 
speech,  using  various  length  phonemes,  are  tabulated  and 
the  results  are  used  to  re-synthesize  the  original  speech. 
This  was  done  by  using  digitized  speech  composed  of  the 
middle  time  slices  from  the  71  five  time  slice  phonemes. 


vn 


Results  indicated  that  the  synthesized  speech  was  understand¬ 
able  when  the  recognition  results  successfully  identified 
the  proper  phoneme  for  approximately  4  consecutive  time  slices 
An  extraneous  phoneme  choice  in  a  consecutive  grouping  of  a 
phoneme  choice  did  not  seriously  degrade  the  output  since  it 
accounted  for  only  an  .08  second  time  slice. 


PERFORMANCE  IMPROVEMENTS  OF 
THE  PHONEME  RECOGNITION  ALGORITHM 

I.  Introduction 

Speech  recognition  has  been  studied  since  the  1940's, 
where  experiments  were  made  with  visual  hearing.  Sub¬ 
jects  in  this  experiment  studied  spectrograms  and  inter¬ 
preted  what  had  been  said  (Ref  1)  . 

In  the  1950 *s  automatic  recognition  was  introduced 
in  a  paper  that  described  a  method  of  recognizing  the 
utterance  of  digits  (Ref  1:44-45). 

Justification 

The  need  for  automatic  speech  recognition  stems  from 
the  fact  that  as  the  technology  advances  in  information 
systems,  a  more  direct  interface  is  needed  between  man 
and  machine.  For  military  applications,  this  fact  has  come 
to  the  attention  of  engineers  designing  information  systems 
for  future  aircraft.  In  particular,  this  problem  is  being 
investigated  for  the  AFTI  F-16  aircraft.  In  a  combat  en¬ 
vironment  such  as  a  night  attack  mission,  the  pilot  is  re¬ 
quired  to  monitor  and  operate  navigation,  terrain  following 
sensors, and  weapons  delivery  systems,  in  addition  to 
monitoring  the  normal  flight  instruments .  He's  also  required 
to  make  decisions  concerning  weapons  delivery  tactics, 
threat  avoidance,  and  navigate  to  and  from  target  areas. 

With  such  an  increase  in  workload,  flight  safety  and  mission 
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integrity  are  severely  jeopardized  utilizing  present  tech¬ 
nology  for  the  man-machine  interface. 


Utilizing  both  automatic  speech  recognition  and  speech 
synthesizing  can  further  the  development  of  narrow  band¬ 
width  voice  communication.  This  can  prove  valuable  where 
low  frequency  and  very  low  frequency  communications  are 
used  along  with  their  characteristically  narrow  bandwidths. 
Background 

At  the  present  time,  there  are  commercially  available 
speech  recognition  machines  that  perform  limited  speech 
recognition.  There  are  still  major  drawbacks  to  these 
available  machines. 

First,  most  of  these  machines  still  have  to  go  through 
a  training  session  where  a  speaker  recites  the  words  the 
machine  is  to  recognize.  Even  though  the  training  period 
has  been  made  fairly  painless  to  the  user,  it  still  severely 
limits  the  vocabulary  the  machine  can  understand. 

Second,  because  of  the  training  required  by  the  machines 
they  still  are  largely  speaker  dependent  either  for  an  in¬ 
dividual  or  a  small  group  of  people.  Drastic  changes  in  a 
speaker's  inflection  could  also  degrade  the  recognition 
results  even  though  the  machine  was  trained  by  that  in¬ 
dividual. 

Finally,  recognition  has  been  targeted  mainly  at  the 
recognition  of  the  utterance  of  single  words.  Problems 
with  recognizing  connected  speech  must  also  be  addressed. 

This  can  prove  to  be  more  difficult  because  the  utterance 


of  an  individual  word  is  distorted  by  the  proximity  of 
other  words  (Ref  2:4). 

Seelandt,  in  his  thesis  (Ref  2) ,  approached  the  pro¬ 
blem  of  recognizing  connected  speech.  A  phoneme  recogni¬ 
tion  scheme  was  used  with  a  set  of  71  different  phoneme 
templates  used  as  the  basis  of  his  recognition  routine. 
Phonemes  were  selected  from  connected  speech  that  has  been 
digitized  by  sampling  the  analog  speech  at  8000  HZ.  The 
speech  consisted  of  a  series  of  utterances  of  the  digits 
zero  through  nine.  The  selected  phonemes  represented  40 
msec  of  speech  or  5  time  slices  where,  each  time  slice  is 
8  msec.  The  phonemes  were  chosen  from  Seelandt' s  own  speech 
which  consisted  of  four  separate  speechfiles.  Phonemes 
were  chosen  with  the  aid  of  a  software  tool  that  allowed 
the  user  to  observe,  on  the  Tektronix  4010  graphic  terminal, 
the  spectrogram  of  20  blocks  or  800  msec  portions  of  a 
speechfile  and  be  able  to  simultaneously  listen  to  any 
portion  of  the  20  blocks  chosen.  A  block  of  speech  can 
also  be  described  as  a  group  of  4  vectors.  In  this  research 
project,  the  term  vector  and  time  slice  are  used  inter- 
changably  with  each  vector  containing  32  frequency  com¬ 
ponents  ranging  from  D.  C.  to  4000  HZ. 

Method 

As  in  Seelandt' s  thesis,  (Ref  2)  the  phoneme  recog¬ 
nition  scheme  will  use  as  a  basis  the  71  phonemes  from 
Seelandt 's  speechfile  .  The  recognition  routines  that 
are  used  were  developed  by  Seelandt  (Ref  2)  and  modified 


by  Montgomery  (Ref  31. 

The  basis  for  this  recognition  routine  can  be  found 
in  Potter,  Kopp  and  Green's  research  where  they  demon¬ 
strated  that  after  an  education  process,  humans  can  recog¬ 
nize  phonetic  patterns  in  speech  spectrograms  (Ref  11 . 
Using  this  as  justification  the  premise  is  made  here  that 
a  machine  should  be  able  to  use  the  spectrogram  data  to 
recognize  speech. 

The  problem  to  be  addressed  in  this  thesis  is  how 
much  information  in  the  phoneme  prototype  is  needed  to 
get  the  results  Seelandt  obtained  with  his  prototype 
and  how  much  information  is  needed  to  synthesize  intel- 
ligable  speech  from  the  recognition  routines.  Information 
as  used  here,  refers  to  the  number  of  time  slices  used  in 
the  phoneme  prototypes. 

Scope 

In  Seelandt' s  thesis,  the  length  of  the  phoneme  pro¬ 
totypes  was  rather  arbitrarily  chosen  to  be  5  vectors. 

The  purpose  of  this  project  was  to  determine  if  this  was 
indeed  an  optimum  length  for  the  prototype  and  what,  if 
any,  were  the  performance  differences  of  prototypes  made 
up  of  fewer  vectors.  Information  and  software  tools 
from  Seelandt* s  thesis  were  used  extensively  since  this 
was  to  be  a  follow-on  study  using  his  results. 


Because  there  is  no  perfect  model  to  compare  re¬ 
cognition  results  with,  the  5  vector  prototypes  used 


by  Seelandt  would  be  used  as  a  data  base  for  comparison 
Other  parameters  were  also  to  be  kept  constant  through 
out  the  project.  These  included  using  6dB  preemphasis 
beginning  at  500  Hz,  using  the  threshold  value  of  10 
(See  Seelandt,  Ref  2]  for  the  RMS  energy  in  a  speechfile 
and  using  a  Hamming  window  in  the  computation  of  the  spec¬ 
trograms  . 

As  shown  in  Seelandt* s  thesis  (Ref  2),  there  was  no 
performance  advantage  when  a  Hamming  window  was  used 
instead  of  a  rectangular  window.  With  this  in  mind,  the 
Hamming  window  was  used  only  for  the  sake  of  consistency . 
Sequence  of  Presentation 

The  sequence  to  be  followed  in  this  report  will  be 
to  first  discuss  the  phoneme  template  development;  secondly 
how  qualitative  analysis  was  accomplished,  the  next  section 
will  follow  with  how  speech  synthesis  was  accomplished, 
the  fourth  major  topic  will  be  how  Martin's  speech  recog¬ 
nition  routine  was  used  in  this  project  and  last,  the  re¬ 
sults  and  recommendations  will  be  addressed. 


Previous  work  by  Felkey  (Ref  4)  and  Seelandt  (Ref  2) 
led  to  the  development  of  some  of  the  software  tools  that 
were  used  in  this  effort. 

PICTALK  allows  the  user  to  input  a  file  of  processed 
speech  through  the  Eclipse  background  and  have  it  print 
a  spectrogram  on  the  Tektronix  4010  terminal  that  inter¬ 
faces  with  the  Eclipse  foreground.  Once  the  spectrogram 
has  been  printed,  the  program  allows  the  user  to  select 
and  listen  to  any  portion  of  that  speech file.  In  order 
to  output  speech  through  the  Crown  amplifier,  MONITR 
must  be  running  simultaneously  on  the  Nova.  MONITR  mon¬ 
itors  the  Interprocessor  Buffer  (IPB)  and  passes  parameters 
from  PICTALK  that  is  used  by  the  Channel  Operating  Sus- 
tem  (CHOPS) . 

PHOGEN  is  used  to  generate  the  prototype  set  of  * 

phonemes.  These  prototypes  are  5  time  slices  long  and 
the  user  has  the  option  to  either  create  a  new  set  of 
prototypes  or  create  an  averaged  set  of  prototypes.  The 
source  for  the  prototypes  are  the  speech  file.  When 
averaging  is  done,  the  following  formula  is  used: 


zOL} 


(x(i)»n)  +  y  (i) 
n  +  1 
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where:  n  is  the  number  of  times  averaging  has  occured 

y(i)  is  the  ith  component  of  a  time  slice 
to  be  averaged  in. 

X (i)  is  the  ith  component  of  a  time  slice 
from  the  existing  prototype 

z(i)  is  the  ith  component  of  a  time  slice 
from  the  new  prototype  (Ref  2) 

Problems  encountered 

In  general,  problems  were  encountered  when  three  of 
Seelandt's  programs  were  compiled  and  run. 

When  PICTALK  and  MONITR  were  run,  a  Nova  error  with 
value  163  occurred  when  an  attempt  was  made  to  listen 
to  a  portion  of  speech.  In  order  to  try  and  pinpoint 
what  caused  the  error,  flags  were  used  and  values  passed 
to  subroutines  were  echoed.  By  observing  the  flags  and 
backtracking  through  the  code  by  hand,  it  was  found  that 
the  error  originated  from  the  subroutine  CHANNEL.  CHANNEL 
is  a  Fortran  subroutine  designed  to  interface  programs 
running  on  the  Nova  to  the  Cromemeo  Z-2  computer  with 
the  channel  operating  system  loaded  and  running  in  it. 

Studying  the  documentation  on  CHANNEL  revealed  that 
the  Nova  error  was  equivalent  to  channel  error  35  which 
meant  that  the  file  to  be  transferred  was  not  evenly 
divisable  by  the  channel  block  count.  This  was  very 
puzzling  since  the  number  of  blocks  in  the  file  to  be 
transferred  was  set  as  a  constant  of  86  and  the  channel 
block  count  was  a  constant  of  2.  The  only  explanation 


for  the  error  was  that  the  file  used  was  actually  less 
than  86  blocks  and  if  this  occurs ,  the  task  is  aborted. 

This  was  corrected  by  making  the  blocks  transferred  a 
user  defined  variable  instead  of  a  constant. 

Another  problem  encountered  using  PICTALK  and  MONITR 
was  the  fact  that  it  needed  two  terminals  on  the  Eclipse 
and  one  terminal  on  the  Nova  in  order  to  operate.  Initial¬ 
ly,  PICTALK  was  modified  so  that  only  one  Eclipse  terminal 
was  needed  and  the  modified  program  was  renamed  TEKTALK. 
Later,  a  second  modification  was  made  after  the  A/D/A 
converter  for  the  Eclipse  was  operating.  With  this  modi¬ 
fication  all  the  operations  carred  out  by  the  two  computers 
were  consolidated  into  TEKTALK  and  a  swapped  program, 

TALK.  This  allowed  the  Eclipse  to  perform  the  entire  oper¬ 
ation  of  displaying  the  spectrogram  and  output  the  desired 
portion*  of  the  speechfile  through  the  Crown  amplifiers 
and  speakers  in  the  same  manner  as  PICTALK  and  MONITR 
(Ref  5) .  This  also  allowed  the  entire  operation  to  be 
executed  on  one  terminal.  Figure  1  illustrates  the 
configuration  for  operating  the  Eclipse  A/D/A  converter 


with  TEKTALK. 


frequency  generator 
8 KHz  TTL  pulse 


clock 

input 


Figure  1,  Eclipse  A/D/A  system 


Source  of  Phoneme  Template 


As  a  baseline/  Seelandt's  original  set  of  prototypes 
was  used.  All  other  prototypes  used  in  this  study  were 
obtained  by  taking  some  combination  of  vectors  from  the 
original  set.  The  reasoning  behind  this  was  to  use  the 
original  prototype  set  as  a  basis  for  comparison.  Be¬ 
cause  of  the  subjective  nature  Seelandt  used  to  pick  his 
phoneme  prototypes,  another  set  of  prototypes  could  have 
been  chosen  using  different  speechfiles  and/or  using 
different  time  slices  or  vectors.  It  was  decided  at  the 
onset  that  this  would  consume  too  much  time  and  that 
the  original  set  would  serve  adequately  as  a  baseline. 

Originally,  an  indirect  method  was  used  to  generate 
the  new  prototype  sets.  Seelandt  has  developed  a  speech- 
file  made  up  of  just  his  71  phonemes  call  "PHONEMES". 

This  speechfile  was  used  in  speech  synthesis  (Ref  2) . 
Running  this  file  through  TEKTALK  created  the  frequency 
component  file  and  allowed  visual  inspection  of  the 
speech.  PHOGEN  was  then  used  for  creating  the  phoneme 
templates  by  allowing  desired  vectors  to  be  chosen  from 
the  frequency  component  file. 

While  analyzing  the  spectrogram  of  PHONEMES  it 
was  found  there  were  slight  differences  between  the 
spectral  components  of  the  speechfile  and  those  of 
Seelandt's  prototypes  (Ref  2).  It  was  then  decided  that 
since  the  original  objective  was  to  use  Seelandt's 


prototypes  as  a  data  base,  that  the  speechfiles  used  to 
create  Seelandt's  prototypes  would  also  be  used  to  create 
the  new  prototypes.  Table  I  shows  the  speechfile  used 
and  the  5  vector  phoneme  picked  from  them.  Table  II 
lists  the  phoneme  prototype  number  and  their  associated 
vector  numbers  from  the  speechfiles.  The  frequency  com¬ 
ponent  files  of  these  speechfiles  were  created  by  SGRAM 
and  were  then  used  by  PHOGEN. 


Table  I 


Source  Speechfiles  used  and 
their  associated  phoneme  templates 


Speech  File 
FSP1B.KS 


I  FSP4A.KS 

I 


! 

I 

1 

| 


FSP2A.KS 

FSP3A.KS 


phoneme  numbers 

1  through  24 

25  through  30 
57  through  62 

31  through  56 

63  through  71 


■i 

i 


\ 


L2 


Table  XI 


1 

Five  Vector 

associated 

Phonemes  and  the  vector  numbers  they 

with  in  the  original  speechfiles. 

are 

1 

Beginning 

Beginning 

1 

phoneme 

Vector 

Phoneme 

Vector 

number 

number 

number 

number 

1 

1 

21 

201 

1 

2 

9 

22 

211 

1 

3 

15 

23 

221 

4 

27 

24 

230 

i 

5 

34 

25 

87 

1 

6 

41 

26 

96 

7 

50 

27 

102 

■  ■  <*i 

8 

56 

28 

108 

i  C« 

9 

61 

29 

118 

IQ 

67 

30 

125 

| 

11 

72 

31 

25 

■ 

12 

104 

32 

32 

13 

113 

33 

47 

i 

14 

118 

34 

61 

15 

127 

35 

69 

16 

134 

36 

81 

i 

17 

139 

37 

87 

18 

148 

38 

96 

19 

190 

39 

104 

1 

20 

195 

40 

113 

l 
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Single  Vector  Phoneme  -  Template  Selection 


The  first  prototype  set  created  was  the  single 
vector  phonemes.  These  were  created  by  taking  the  middle 
vector  from  the  original  5  vector  phoneme  template  and 
using  it  as  a  template  for  that  phoneme.  The  middle  vec¬ 
tor  was  chosen  because  in  a  five  vector  phoneme,  the  dif¬ 
ference  between  the  middle  vector's  frequency  components 
and  those  of  the  vector's  farthest  from  it  are  likely  to 
be  less  than  the  first  vector's  frequency  components  com¬ 
pared  to  the  fifth  vector's  component.  Because  the  spectral 
components  differ  greatly  from  phoneme  to  phoneme,  this 
is  not  always  true.  There  are  some  phonemes  that  have 
spectral  components  that  don't  change  very  much  as  the  time 
slice  changes,  an  example  of  this  is  the  "n"  sound  in 
"one".  In  this  case,  any  one  of  the  five  could  perform 
just  as  well  as  a  phoneme  template. 

Two  Vector  Phoneme  Template  Selection 

Initially,  two  sets  of  templates  were  created  to 
compare  with  each  other  and  the  other  template  sets. 

The  first  set  consisted  of  templates  created  by  using 
the  second  and  middle  vector  from  the  five  vector  tem¬ 
plates.  The  second  set  was  created  from  the  first  and 
second  vector  of  the  five  vector  templates.  The  com¬ 
parison  testing  consisted  of  using  each  template  set  as 
the  phoneme  prototypes  in  PHDIST  and  running  the  recog¬ 
nition  routines  against  the  speechfiles  the  original 


phoneme  prototypes  were  taken  from.  As  expected,  it 
showed  that  the  majority  of  the  phonemes  performed  al¬ 
most  identically  on  the  given  speechfiles.  There  were 
isolated  cases  where  one  phoneme  of  a  given  set  performed 
better  than  the  corresponding  phoneme  of  the  other  set. 

The  method  used  in  the  comparison  will  be  covered  in  chap-: 
ter  III.  With  these  results,  a  third  set  of  phoneme  tem¬ 
plates  was  developed  using  the  phoneme  templates  from  the 
first  two  sets.  If  the  two  phonemes  had  identical  perfor¬ 
mance,  it  was  just  an  arbitrary  choice  as  to  which  one  was 
used.  If  one  showed  an  improvement  in  performance  over 
the  other  it  was  chosen  as  the  phoneme  template  for  the 
third  set.  What  resulted  was  a  hybrid  set  of  phoneme 
templates  that  contained  the  best  performers  of  the  two 
sets  of  phoneme  templates,  and  thus  would  have  an  op¬ 
timized  performance  for  two  vector  phoneme  prototypes. 
Since  the  locations  of  the  vectors  used  as  prototype 
phoneme  templates  were  now  know,  a  set  of  phoneme  tem¬ 
plates  could  be  constructed  with  relative  ease  and  in  a 
short  period  of  time.  With  this  in  mind  and  for  the  sake 
of  completeness,  two  more  sets  of  two  vector  phoneme  tem¬ 
plates  were  constructed  for  comparison  testing  against  the 
third  set  of  phoneme  templates.  The  fourth  set  was  made 
up  of  phonemes  containing  the  middle  and  fourth  vectors 
and  the  fifth  set  was  made  up  of  phonemes  containing  the 
fourth  and  fifth  vectors  of  each  original  five  vector 


phoneme  template.  The  same  type  of  comparison  testing  was 
performed  using  these  last  three  sets  of  phoneme  templates. 
As  in  the  first  set  of  tests, the  best  performers  from  the 
three  sets  were  combined  into  a  sixth  and  final  set.  If 
there  were  no  clearly  superior  performer  for  a  given  phoneme 
from  the  three  sets,  the  template  from  the  third  set  was 
arbitrarily  chosen  to  be  included  in  the  sixth  set.  The 
reason  for  this  judgement  call  was  strictly  for  convenience. 
The  program  used  to  create  the  phoneme  template  set  is 
constructed  to  either  create  a  totally  different  set  of 
templates  or  it  can  modify  an  existing  set  of  templates, 
thus  saving  time  and  eleminating  a  lot  of  tedious  work 
that  increases  the  chances  for  an  error  to  occur.  This 
sixth  and  final  set  of  phoneme  templates  is  the  set  that 
was  used  in  the  comparison  test  between  the  various  length 
phoneme  templates.  Table  III  shows  a  listing  of  phonemes 
in  this  final  set,  labelled  set  F,  and  the  orginating  tem¬ 
plate  set  for  each  phoneme.  Set  A  is  the  first  set,  B 
is  the  second  set,  C  is  the  third  set,  D  is  the  fourth 
set  and  E  is  the  fifth  set. 


Table  III 


Final  Two  Vector  Phoneme  Templates 
and  Template  Set  They  Originated  From 


Vector 

number 

Originating 

set 

Vector 

number 

Originating 

set 

1 

C 

20 

A 

2 

E 

21 

A 

3 

A 

22 

E 

4 

D 

23 

E 

5 

A 

24 

E 

6 

D 

25 

B 

7 

E 

26 

D 

8 

E 

27 

E 

9 

E 

28 

A 

10 

E 

29 

B 

11 

B 

30 

B 

12 

A 

31 

B 

13 

A 

32 

B 

14 

A 

33 

B 

15 

A 

34 

B 

16 

A 

35 

D 

17 

A 

36 

A 

18 

A 

37 

E 

19 

A 

38 

A 

18 


Table  III  Cont 


i  «• 


i 


i 


19 


Three  Vector  Phoneme  Template  Selection 


!• 


For  the  comparison  testing/  a  phoneme  template 
set  consisting  of  the  middle  three  vectors  from  the 
original  phonemes  was  created.  The  set  was  created 
using  the  same  procedure  used  to  create  the  one  and  the 
two  vector  phoneme  template  sets.  Only  one  set  of  three 
vector  phonemes  was  created.  The  reasoning  behind  creat¬ 
ing  only  one  set  was  due  to  the  tremendous  amount  of  over¬ 
lapping  that  would  occur  in  the  creation  of  different  sets 
of  templates.  What  this  means  is  that  if  another  set 
of  phonemes  was  created,  only  one  vector  per  phoneme 
would  be  different  from  the  first  set  of  three  vector 
phonemes.  This  single  vector  difference  amounts  to  only 
8  msec  of  speech.  This  overlapping  would  be  expected  to 
produce  the  same  results  in  the  recognition  routines  when 
compared  to  the  set  of  three  vector  phoneme  templates  that 
were  used  in  this  project. 

Verification  of  Phoneme  Template  Sets 

A  verification  process  was  initiated  after  the  selec¬ 
tion  of  each  set  of  phoneme  templates .  This  process  was 
necessary  to  insure  that  the  vectors  used  in  the  set  of 
templates  match  the  desired  vectors  in  the  original  tem¬ 
plate  set.  The  verification  was  done  by  visual  inspection 
of  the  spectrograms.  Because  the  phoneme  template  sets 
were  actually  record  files  containing  the  Discrete  Fourier 
Transformed  frequency  components  of  speech,  a  modified 


20 


spectrogram  program  had  to  be  used  to  display  the  spectro¬ 
gram  of  each  phoneme  template  set.  This  program  was  SGRAM 
4  developed  by  Seelandt  to  be  used  specifically  for  this 
purpose.  Once  the  spectrograms  were  obtained, a  vector  by 
vector  comparison  was  done  between  the  phoneme  template 
set  and  the  original  five  vector  phoneme  template  set. 

This  verified  that  wrong  vectors  were  not  inadvertently 
chosen  and  that  corrections  made  to  the  template  sets  were 


indeed  made. 


III.  Qualitative  Analysis  off  Phoneme  Template  Sets 


Method 

Once  the  different  template  sets  were  formed,  a  method 
for  comparing  them  had  to  be  devised.  This  involved  the 
use  of  software  tools  and  visual  inspection.  The  one 
major  problem  to  overcome  was  what  to  use  as  a  model  for 
comparison  and  what  parameters  were  to  be  used  in  the  com¬ 
parison.  Once  this  problem  was  resolved,  software  could 
be  written  to  suit  the  needs  dictated  by  the  comparison 
tests. 

As  mentioned  earlier,  the  intent  of  this  project  was 
to  find  out  if  five  vector  phoneme  templates  were  indeed  the 
best  choice  for  prototypes.  Based  on  this  premise,  the 
five  vector  phoneme  template  set  was  chosen  as  the  basis 
for  comparison.  Accordingly,  the  speech  that  was  used  for 
forming  the  phoneme  template  set  was  also  used  as  the  stand¬ 
ard  for  exercising  the  different  phoneme  template  sets. 

This  procedure  was  chosen  because,  with  speech,  there  can 
be  no  standard  universal  model  for  comparison.  It  was 
felt  that  using  the  five  vector  phoneme  templates  in  the 
recognition  routines  would  produce  the  optimum  results. 

These  results  then  served  as  the  standard  model  in  the 
comparisons. 

Software  Tools 
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With  the  basic  ground  work  laid  for  the  comparison 
testing,  programs  could  be  developed  and  utilized  to 
provide  the  results  needed  to  perform  the  final  analysis 
of  the  comparison  testing.  Initially,  the  phoneme  re¬ 
cognition  routines  used  were  Seelandt's  TRYDIST  5  and  LIS¬ 
TER  4  programs.  TRYDIST  5  calculated  the  distances  between 
the  desired  phoneme  template  set  and  the  chosen  speechfile. 
Each  phoneme  is  compared  to  a  segment  of  speech  equal  to 
the  length  of  the  template.  The  speechfile  has  been  pro¬ 
cessed  by  a  Descrete  Fourier  Transform  into  8  msec  vectors 
in  the  same  format  as  the  templates.  In  this  way  TRYDIST  5 
then  increments  the  segment  of  speech  one  vector  at  a 
time  and  performs  another  distance  measure  against  the 
given  phoneme  template.  In  this  manner,  each  phoneme  is 
compared  against  the  entire  speechfile;  giving  a  distance 
score  at  each  vector  excluding  the  last  N  vectors  where  N 
is  the  number  of  vectors  in  the  phoneme  prototypes.  LIS¬ 
TER  4  then  takes  the  results  from  TRYDIST  5  and  outputs 
the  five  closest  matches  according  to  their  distances  for 
each  vector.  It  also  outputs  a  range  factor  and  scale 
factor  for  each  vector,  but  these  were  not  utilized  in 
this  project  (Ref  21 . 

Problems  were  encountered  when  these  programs  were 
first  used.  In  order  to  allow  the  program  to  work  with 
various  length  phoneme  templates,  TRYDIST  5  had  some  op¬ 
tional  statements  that  needed  to  be  compiled.  When  an 


attempt  was  made  to  compile  this  program,  compile  errors 
occurred.  By  troubleshooting  the  program,  the  error  was 
found  to  be  two  illegal  statements  that  were  used  only 
when  the  compile  option  was  chosen.  These  referred  to  a 
specific  file  that  was  not  used  for  any  purpose  in  the 
program.  With  all  statements  referring  to  this  file 
eliminated,  the  program  compiled  successfully. 

After  working  with  TRYDIST  5  and  LISTER  4,  another 
problem  surfaced  that  lead  to  the  elimination  of  these 
programs  as  a  primary  means  of  performing  the  recognition 
routines.  This  problem  was  in  the  output  of  LISTER  4 
where  the  top  five  choices  were  printed  for  each  vector. 
The  output  routine  listed  the  top  choices  by  the  phoneme 
name  (i.e.  ZX,  AH,  IY,  etc.l.  This  proved  to  be  unsatis¬ 
factory  for  this  project  since  the  same  name  was  given  to 
more  than  one  phoneme  template  in  many  cases.  An  example 
of  this  is  the  "T"  sound  in  both  J'two"  and  "eight".  "Two" 
has  three  phonemes  that  were  named  "Tx".  Fortunantly, 
Montgomery  also  realized  this  and  modified  TRYDIST  5  to 
create  PHDIST  and  modified  LISTER  4  to  create  CHOICE  5. 
These  programs  perform  the  same  function  as  Seelandt's 
program,  but  the  output  would  be  the  top  phoneme  template 
choice  for  each  vector  as  a  phoneme  number.  This  made  it 
easy  to  see  exactly  which  phoneme  was  chosen  (Ref  31 . 

As  good  as  the  output  of  CHOICE  5  was  for  giving  an 
indication  of  the  phoneme  templates'  performance,  it  still 


was  not  in  a  desirable  form  for  documenting  the  results 
of  the  comparison.  A  more  qualitative  and  visual  indi¬ 
cation  of  the  test  was  needed  as  documentation.  It  was 
felt  that  the  best  method  for  meeting  these  parameters 
was  using  graphical  representations  of  the  results. 

This  task  was  performed  by  the  program,  PLOTPHON.  In 
this  program  the  results  from  CHOICE  5  were  utilized. 

The  user  chooses  the  phoneme  to  be  examined  and  the  pro¬ 
gram  searches  the  results  from  CHOICE  5,  looking  for  where 
this  chosen  phoneme  shows  up.  If  the  phoneme  appears  more 
than  twice  it  then  proceeds  to  the  next  step  of  the  pro¬ 
gram,  otherwise  it  informs  the  user  that  the  phoneme  ap¬ 
pears  less  than  twice  and  ends  the  program.  The  next 
step  involves  modifying  the  phoneme's  distance  scores  to 
lie  in  an  amplitude  value  between  one  and  zero.  This  is 
done  by  using  the  following  equation: 


Np  -  Minin 
P  Mmax  -  Mmin 


Ap  -  Amplitude  of  distance  score  for  vector  p. 

Np  -  Phoneme  distance  score  for  vector  p. 

Mmin  -  Minimum  distance  score  in  speechfile  for  given 
phoneme . 


Mmax  -  Maximum  distance  score  in  speechfile  for 
given  phoneme. 
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Once  the  computation  has  been  performed ,  the  amplitudes 
are  then  plotted  on  a  graph  versus  their  corresponding 
vector  numbers  from  the  speechfile. 

A  modification  of  this  program  also  allows  the  user 
to  construct  a  window  illustrating  where,  in  a  speechfile, 
a  phoneme  was  taken  from.  This  was  useful  in  showing 
where  a  given  phoneme  was  expected  to  appear  when  the 
original  speechfile  was  used  in  the  recognition  routine. 
With  a  graph  for  a  given  phoneme  number  and  from  each  set 
of  templates,  the  results  can  now  be  easily  compared  and 
with  a  hard  copy  of  the  graph,  the  results  are  also 
documented.  Examples  of  the  resultant  graphs  are  shown  in 
Figure  2  through  Figure  5.  A  more  extensive  documen¬ 
tation  of  the  results  is  in  Appendix  B  . 


? PHONEME  AMPL 


2 PHONEME  AMPL. 


? PHONEME  AMPL. 


AMPLITUDES  FOR  PHONEME  #  38  FROM  FILE  3UFSP4A.0i 
WORDS  SPOKEN  AND  SPEAKER*  8-3  BY  SEELANDT 
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IV.  Speech  Synthesis 


As  results  from  the  recognition  routines  were  being 
obtained,  it  was  suggested  that  a  way  be  devised  to  output 
the  results  as  speech.  This  would  give  an  audible  in¬ 
dication  of  how  the  recognition  routine  performed.  It 
also  would  serve  as  another  metric  for  measuring  the  per¬ 
formance  of  the  phoneme  template  sets.  Albeit,  it  is  a 
very  subjective  metric  because  it  is  based  on  what  is  per¬ 
ceived  through  each  individual's  auditory  senses. 

Method 

In  human  speech,  an  utterance  consists  of  a  combination 
of  sounds  from  a  basic  set  of  sounds  that  are  called  phonemes. 
For  synthesis  the  same  principle  is  used.  The  computer 
has  stored  in  memory  a  record  file  of  the  basic  sounds 
needed  to  produce  an  utterance.  For  this  project,  the 
question  was  raised  as  to  how  much  information  is  needed  in 
each  phoneme  to  produce  understandable  speech.  The  best 
way  to  answer  this  question  was  to  start  at  the  smallest 
number  of  vectors  per  phoneme  and  build  upon  this  basic 
building  block  until  understandable  speech  has  been  a- 
cheived.  Therefore,  the  first  file  of  phonemes  used  in 
synthesized  speech  was  created  as  a  record  file  of  digitized 
speech  using  only  the  middle  vector  from  each  of  the  71 
five  vector  phonemes. 


Software  Tools 


The  tool  used  to  create  this  file  was  a  program  called 
SPEECH.  This  program  allowed  the  user  to  select  the  de¬ 
sired  vectors  in  a  speechfile  and  put  them  in  another  file 
for  use  with  synthesized  speech.  A  spectrogram  of  the 
speechfile  of  single  vector  phonemes  is  illustrated  in 
Figure  6. 

After  the  file  of  phonemes  was  created,  a  coherence 
test  was  devised  to  compare  the  single  vector  phonemes 
to  the  five  vector  phonemes.  Before  a  direct  compari¬ 
son  could  be  made,  the  single  vector  phoneme  set  had  to 
be  modified.  This  was  due  to  the  fact  that  each  phoneme 
represented  only  8  msec  of  speech  and  if  the  speechfile 
was  to  be  an  output  of  speech  it  would  be  unintelligable. 
What  was  needed  was  a  procedure  to  lengthen  the  speechfile 
by  repeating  each  phoneme  a  suitable  number  of  times  so 
that  the  length  of  the  modified  phoneme  speechfile  cor¬ 
responded  to  that  of  the  five  vector  phoneme  speechfile. 

The  creation  of  this  file  was  done  in  a  two  step  process. 
The  first  step  involved  utilizing  one  of  Seelandt's 
programs  call  TYPETALK.  This  program  created  a  file  of 
vector  numbers  corresponding  to  those  found  in  a  speechfile 
and  a  phoneme  number  that  was  repeated  five  times.  An  ex¬ 
ample  of  the  format  for  this  file  is  as  follows:  Vector 
one  would  be  followed  by  phoneme  one,  as  would  vector  two 
through  five;  Vector  six  would  be  followed  by  phoneme  two. 
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Figure  6 .  Single  vector  phoneme  speechf ile 
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as  would  vector  seven  through  ten;  and  the  format  would 
continue  through  the  seventy-first  phoneme. 

Upon  creation  of  this  file,  program  TALK  2  would  be 
used  to  create  the  speechfile.  This  is  also  the  program 
that  utilizes  the  speechfile  of  single  vector  phonemes 
that  was  originally  created  by  SPEECH.  TALK  2  takes  the 
file  created  by  TYPETALK  and  reads  each  phoneme  number  in 
sequence  and  goes  to  its  phoneme  speechfile  and  selects 
the  corresponding  phoneme  from  that  file  and  puts  it  into 
a  user  designated  speechfile.  The  resulting  end  product 
is  a  speechfile  consisting  of  71  groups  where  each  group 
consisted  of  a  single  vector  phoneme  repeated  five  times. 
For  further  clarification  of  this  description,  Figure  7 
illustrates  the  resulting  speechfile  as  a  spectrogram. 

In  contrast.  Figure  8  illustrates  the  spectrogram  of  the 
speechfile  made  up  of  the  71  five  vector  phonemes.  As 
seen  in  the  figures  there  is  a  marked  difference  between 
the  two  speechfiles.  With  the  five  vector  phonemes,  the 
file  contains  enough  of  the  speaker's  characteristics  for 
a  listener  to  identify  the  speaker  even  though  a  large 
portion  of  the  original  speechfile  was  not  used  when  the 
phonemes  were  developed.  This  is  true  because  when 
Seelandt  picked  his  phonemes,  he  was  looking  for  changes 
during  an  utterance.  Therefore,  the  phoneme  speechfile 
sounds  slightly  choppy  and  the  individual  sound  (i.e. 
phoneme}  is  not  held  for  the  same  length  of  time  as  in  the 


uttered  speech.. 

When  the  modified  single  vector  phoneme  speechfile 
was  listened  to  it  was  coherent  speech,  but  had  a  very 
mechanical  sound  and  was  also  choppy.  There  was  none  of  the 
recognizable  speaker's  characteristics  as  with  the  five 
vector  phonemes.  This  was  expected  because  this  speech- 
file  contained  only  one-fifth  of  the  information  contained 
in  the  five  vector  phonemes.  However,  there  was  still 
enough  information  to  identify  what  was  said  after  each 
single  vector  phoneme  was  repeated  five  consecutive  times. 

The  results  using  the  modified  single  vector  phoneme 
speechfile  were  encouraging  enough  to  use  the  single  vec¬ 
tor  phonemes  as  the  source  for  synthesized  speech.  This 
precluded  testing  the  two  or  three  vector  phonemes  and 
made  the  algorithm  for  synthesizing  speech  easier  to  deal 
with.  The  program  that  employed  the  synthesized  speech 
algorithm  was  TALK  2.  This  algorithm  was  explained 
earlier,  but  in  this  case  TALK  2  reads  a  file  containing 
the  vector  numbers  from  a  speechfile  and  their  correr 
sponding  phonemes  chosen  in  the  recognition  routines.  In 
this  manner,  the  results  from  the  recognition  routines  be¬ 
comes  an  audible  output  that  is  fed  back  tc  the  user  so 
he  or  she  can  make  a  comparison  between  results  obtained 
using  different  phoneme  templates  and  make  the  judgement 
as  to  which  sounds  the  best.  As  mentioned  earlier,  this 
method  is  some  what  subjective,  but  did  provide  another 
means  to  compare  the  four  sets  of  templates. 
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Figure  7a.  Synthesized  five  vector  phonemes 
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msec 


Figure  7b.  Synthesized  five  vector  phonemes 
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V.  Implementation  of  Martin ' s  Recognition  Routine  (Ref  6) 


In  Martin's  thesis,  there  is  developed  a  recognition 
routine  that  was  used  with  speech  that  was  processed  us¬ 
ing  the  array  processor  CRef  6) .  There  are  three  major 
differences  between  Martin's  program  and  Seelandt's. 

The  first  difference  is  that  Martin's  program  could  only 
use  a  single  phoneme  template  set  in  the  recognition  pro¬ 
cess.  The  second  difference  is  that  Martin's  program 
could  use  the  speechfile  as  a  template  set  and  use  the 
speechfile  in  the  recognition  routine  and  run  it  against 
itself.  This  is  very  useful  in  investigating  similarities 
in  the  speechfile  where  a  vector  may  be  a  close  match  to 
another  unrelated  vector  in  the  utterance.  The  final 
difference  involved  the  method  by  which  distances  were 
computed  between  the  template  and  the  associated  block  of 
speech.  Both  used  the  general  Minkowski  metric  of  order 
s ,  where : 


i 

t  \ 

'  T  V''  1  I  *  i^$ 

aj 

U  0- 

rHi“  J  (Ref  8) 

1=1 

is  the  general 

formula . 

Seelandt  used  the  Minkowski 

distance  with  the  order  equal  to  one.  This  is  also  known 
as  the  City  Block  distance  or  Ml  distance  and,  in  his 


44 


phoneme  distance.  However ,  there  were  a  few  instances 
where  the  top  phoneme  choice  was  different.  Por  the  other 
four  choices  there  were  also  a  few  instances  where  the 
choices  were  reversed  and  also  where  a  different  choice 
altogether  showed  up.  In  all,  the  differences  were 
isolated.  Both  programs  were  checked  for  errors,  but 
none  were  found.  It  was  then  concluded  that  the  isolated 
differences  were  the  result  of  round  off  errors  in  the 
routine.  This  is  very  likely  since  both  programs  convert 
integers  to  real  numbers  and  real  numbers  to  integers  in 
various  portions  of  the  routines.  When  it  was  shown  that 
both  programs  generally  gave  the  same  results,  Martin's 
program  was  chosen  as  the  primary  software  tool  for 
speech  recognition.  This  was  due  to  the  versatility  of 
the  parameters;  Martin's  made  extensive  use  of  menues  to 
effect  changes  in  the  parameters.  Another  advantage 
Martin's  program  had  was  that  it  could  process  a  full  88 
blocks  or  352  vectors  of  speech.  This  is  compared  to 
Seelandt's  program  which  could  only  process  200  vectors 
of  speech  at  a  time.  This  characteristic  was  valuable 
when  the  results  from  the  program  were  an  output  of 
synthesized  speech.  Seelandt's  results  had  to  be  syn¬ 
thesized  as  two  speechfiles  which  added  to  the  distortion 
of  the  speech  because  of  the  cut  off  in  the  middle  of 
an  utterance.  Because  of  the  restriction  Martin's  program 
placed  on  the  template  set,  modifications  had  to  be  made 


program,  was  of  the  following  form: 


for  t  = 


Sj  =  the  jth  speech  sample 
t  =  time  shift 

N  =  Number  of  speech  samples  (Ref  2:53,54} 

In  Martin's  program,  the  user  had  the  choice  of  using  the 
Minkowski  distance  with  the  order  equal  to  one  or  two. 
With  the  order  equal  to  two,  the  formula  is  also  known 
as  the  Eulidean  distance  or  M2  distance.  The  equation 
for  the  formula  is  as  follows: 


d 


E 


(Ref  8:231-240) 


Using  both  programs  and  the  single  vector  phoneme 
template  set,  both  recognition  routines  were  run  using 
the  same  speechfile.  The  top  five  phoneme  choices  per 
vector  were  then  compared  to  note  any  discrepancy  when 
the  City  Block  distance  was  used  in  both  programs. 

Ideally,  both  outputs  should  be  identical  since  all 
parameters  in  both  programs  were  also  identical.  General¬ 
ly  the  outputs  did  track  each  other  in  choosing  the  top 


to  accommodate  the  use  of  different  length  phoneme  templates. 
Two  options  were  available;  one  was  to  modify  the  program 
and  the  other  was  to  modify  the  files  used  by  the  program. 
Because  of  time  constraints  and  possible  interface  problems 
it  was  decided  to  alter  the  speech  and  phoneme  templates' 
spectrum  files. 

In  order  to  explain  how  the  alterations  were  ac¬ 
complished  a  review  of  how  the  recognition  algorithm 
operates  for  phonemes  greater  than  one  vector  and  how 
Martin's  program  utilizes  the  file  created  for  the  tem¬ 
plates  and  speech  is  needed.  As  stated  in  Chapter  III, 
the  recognition  algorithm  operates  by  comparing  a  phoneme 
to  the  speechfile  in  a  fashion  similar  to  a  sliding  window. 
The  window  is  N  vectors  wide,  where  N  is  the  number  of  vec¬ 
tors  in  the  phoneme  template.  The  window  slides  through 
the  speechfile  one  vector  at  a  time  and  has  a  distance  value 
calculated  for  each  increment.  What  this  in  effect  does  is 
lengthen  the  speechfile  by  a  factor  of  N.  All  speechfiles 
used  by  Martin's  program  must  have  a  spectrum  of  the  speech- 
file  calculated.  A  header  block  of  256  works  is  also  added 
to  this  file.  This  header  holds  information  such  as  speech- 
file  name,  number  of  components  per  vector,  and  first  and 
last  vector  of  the  block  of  speech  to  be  used  in  Martin's 
program.  Martin's  program  inserts  this  information  from 
menu  selections  and  later  uses  this  information  to  perform 
the  recognition  routines. 


Alterations  to  the  templates'  spectrum  file  was 
limited  to  changing  values  in  the  header  block.  For  an 
N  vector  phoneme  template ,  the  number  of  frequency  com¬ 
ponents  per  vector  was  increased  by  a  factor  of  N.  There¬ 
fore  if  there  are  32  frequency  components  per  vector  and 
the  phoneme  length  was  four  vectors,  the  altered  vector 
would  have  128  frequency  components.  With  each  vector 
made  up  of  four  32  component  vectors,  the  value  for  the 
number  of  vectors  in  the  template  set  also  had  to  be  changed 
Therefore  the  original  value  in  the  header  block  was  di¬ 
vided  by  N  and  this  new  value  was  substituted  back  into  the 
header . 

Alteration  of  the  speech  spectrum  file  was  more  ex¬ 
tensive  than  that  involving  the  template  spectrum  file. 

The  value  for  the  components  per  vector  was  changed  to 
match  that  of  the  template  file.  The  value  for  the  number 
of  vectors  in  the  speechfile  was  also  changed.  In  this 
case,  a  value  that  is  one  less  than  the  number  of  vectors 
per  phoneme  is  subtracted  from  the  number  of  vectors  in 
the  unaltered  speechfile.  The  last  alteration  involved 
the  speechfile  itself  where  it  had  to  be  altered  to  conform 
to  the  recognition  algorithm  for  multiple  vector  phonemes. 

As  stated  earlier,  the  window  was  incremented  one  vector 
at  a  time.  To  duplicate  this  effect  in  Martin's  program, 
the  speechfile  had  to  be  lengthened  by  treating  the  vec¬ 
tors  in  the  window  as  a  single  vector.  In  this  manner, 


the  contents  in  the  window  as  it  increments  through  the 
file  is  sequently  stored  in  another  file  as  the  altered 
speechfile.  An  example  of  this  procedure  is  shown  in 
figure  9  for  a  four  vector  phoneme  template. 
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Figure  9.  Four  vector  phoneme  template 


The  alteration  routine  was  performed  by  a  program 
called  CHNGFILE.  As  can  be  seen,  if  the  speechfile  in¬ 
volved  was  originally  88  blocks  long  the  altered  speechfile 
would  require  an  enormous  amount  of  file  space.  Because 
of  the  size  of  the  file  it  also  takes  an  extraordinary 
amount  of  computer  time  to  perform  the  computations  in 
the  recognition  routine.  For  an  88  block  unaltered  speech- 
file  this  time  can  be  on  the  order  of  15  to  20  minutes, 
even  though  using  this  procedure  for  multiple  vector 
phonemes  precluded  most  interface  problems  with  Martin's 


program,  one  problem  did  surface.  This  problem  came  about 


when  altered  template  sets  and  speechflles  were  used  for 
three  and  five  vector  phonemes.  In  these  cases,  Martin's 
program  would  run  without  displaying  error  conditions  but 
the  output  would  show  glaring  mistakes.  These  mistakes 
typically  would  show  up  as  a  sequence  of  phoneme  choices 
that  was  repeated  throughout  the  speechfile  or  as  a  single 
top  phoneme  choice  that  would  appear  at  regular  intervals 
in  the  output.  Because  of  time  constraints,  the  problem 
could  not  be  located;  but  because  it  didn't  show  up  using 
single,  two  or  four  vector  phonemes  it  was  concluded  that 
the  problem  was  tied  to  the  number  of  frequency  components 
per  vector. 

Since  only  multiples  of  32  could  be  used  for  the  num¬ 
ber  of  frequency  components  per  vector,  a  set  of  four  vector 
phoneme  templates  was  created.  This  set  was  used  instead 
of  the  five  vector  phonemes  in  comparison  testing  using 
Martin's  programs.  Because  of  the  difference  of  only  one 
vector,  it  was  felt  that  the  results  would  still  be  valid 
in  the  test  cases.  This  assumption  was  proven  out  in  a 
comparison  of  results  using  four  vector  and  five  vector 
phoneme  templates  in  Seelandt's  program.  With  this  as¬ 
sumption  proven,  the  results  from  Martin's  program 
were  modified  by  Beachy's  program  to  present  a  more  de¬ 
scriptive  output  (Ref  7J .  This  consisted  of  a  listing 
showing  the  top  five  choices  for  each  vector  in  a  speech- 
file  in  a  format  similar  to  Seelandt's  (Ref  2).  The  out- 


put  from  Beachy's  program  was  then  modified  to  create  a 
file  of  the  results  that  were  compatable  with  TALK  2. 

The  results  from  the  four  vector  phoneme  templates,  were 
used  as  the  basis  of  the  comparison  and  established  the 
baseline. 


Results  and  Recommendations 


Initially,  the  purpose  of  the  research  project  was 
to  analyze  the  results  of  speech  recognition  using  various 
length  phoneme  templates  and  determine  which  phoneme  tem¬ 
plate  set  gave  the  best  overall  performance.  A  secondary 
effort  during  the  research  project  was  the  synthesis  of 
speech  using  single  vector  phonemes  as  the  source.  The 
synthesized  speech  was  also  used  as  a  performance  indicator 
for  the  various  length  phonemes.  In  conjunction  with  the 
development  of  synthesized  speech,  software  was  developed 
to  utilize  Martin's  program  DRVR  in  the  evaluation  of 
single,  two  and  four  vector  phoneme  templates. 

Results 

Results  from  the  recognition  routines  showed  that 
there  was  a  gradual  degradation  as  the  length  of  the  phoneme 
decreases.  Table  IV  lists  the  phonemes  for  each  template 
set  that  showed  a  degradation  in  performance  when  compared 
to  the  five  vector  phoneme  template  set.  There  was  a  de¬ 
finite  pattern  in  the  phonemes  that  tended  to  be  dependent 
on  the  phoneme  length.  These  phonemes  tended  to  be  those 
that  had  a  larger  variation  in  spectral  components  from 
vector  to  vector,  such  as  with  the  fricative  and  sibilence 
sounds.  If  the  phonemes  had  small  variaties  in  the  com¬ 
ponents  from  vector  to  vector  there  were  no  significant 


changes  in  performance  between  single  vector  or  five  vector 
phonemes.  With  the  degradation  resulting  from  the  phonemes 
in  Table  IV,  the  five  vector  phoneme  templates  had  the  best 
overall  performance. 

The  speech  synthesis  portion  of  this  project  almost 
exclusively  used  the  output  generated  by  Martin's  program. 
The  only  time  the  output  from  Seelandt's  program  was  used 
was  in  verifying  the  software  associated  with  Martin's 
program.  This  was  done  by  both  visually  checking  the  out¬ 
put  and  then  comparing  the  synthesized  speech  by  recording 
the  two  speechfiles  simultaneously  on  the  Ampex  reel-to- 
reel  recorder.  The  tape  was  then  played  back  and  by  lis¬ 
tening  to  the  recording,  a  comparison  could  be  made.  This 
same  method  was  applied  when  comparing  the  synthesized 
output  for  results  using  both  Ml  and  M2  distances  and  the 
single,  two  and  four  vector  phoneme  templates.  This  gave 
a  total  of  six  separate  synthesized  speechfiles  for  each 
speechfile  that  was  run  through  the  recognition  routine. 
Using  the  same  speechfiles  that  were  used  for  the  first 
portion  of  the  research,  the  results  from  the  four  vector 
templates  sounded  better  than  the  other  two  template  sets. 
On  the  other  hand  there  was  very  little  difference  in  the 
quality  when  comparing  the  two  distances  using  the  same 
template  set.  This  was  further  verified  by  examining  the 
printed  outputs  from  the  recognition  routine.  If  a  choice 
had  to  be  made,  Ml  had  a  marginal  advantage  over  the  M2 


also  ronfused  phonemes  9  with  10,  65  with  66 
and  57  with  58 


distance  but  not  enough,  to  make  a  significient  difference 
in  the  results.  The  recognition  and  synthesis  routines 
were  then  run  using  speech  other  than  Seelandt’s.  Using 
a  limited  population  of  test  subjects,  the  results  were 
divided  as  to  which  template  set  produced  the  best  results. 
Those  that  were  familiar  with  the  project  picked  the  four 
vector  templates  as  the  best,  but  those  having  no  experience 
with  speech  recognition  tended  to  pick  the  single  vector 
templates  as  the  best.  During  the  interviews  the  inex¬ 
perienced  listeners  said  they  picked  the  single  vector 
templates  because  there  was  less  spurious  noise  in  the 
speechfile.  The  experienced  listeners  tended  to  pick  the 
four  vector  templates  because  of  the  better  results  in¬ 
volving  the  fricative  and  sibilance  sounds.  Because  of 
this  split,  it  also  showed  that  as  speech  other  than  that 
used  to  pick  the  phonemes  was  used,  it  became  increas¬ 
ingly  more  difficult  to  make  a  consistant  judgement  on 
which  phoneme  template  performed  the  best. 

Recommendations 

In  a  research  project  such  as  this  that  was  so  time 
consuming,  constraints  had  to  be  made  to  keep  it  at  a 
more  managable  size.  Therefore,  the  following  recommenda¬ 
tions  are  made  to  gain  more  insight  into  the  problem  of 
speech  synthesis  and  speech  recognition. 

The  first  recommendation  is  to  analyze  the  utterance 
of  useful  words  such  as  frequency,  step,  CCIP  and  develop- 


ing  a  more  complete  phoneme  template  Incorporating  phonemes 
derived  from  these  words.  This  will  result  in  a  phoneme 
set  that  would  be  more  universal  and  will  be  more  useful 
from  a  practical  point  of  view. 

A  second  recommendation  involves  decreasing  the  pro¬ 
cessing  time  of  Martin's  program  using  multiple  length 
phonemes.  A  routine  needs  to  be  devised  that  will  take 
the  output  of  any  given  phoneme  template  set  that  has  been 
used  in  Martin's  program  and  perform  a  summation  of  the 
distance  using  the  equation: 

k-1 

•■('»)=  ^  'i.k.i+k 

Where:  n  =  phoneme  number 

k  »  number  of  yectors  per  phonemes 

h  =  0  f  1  ,  2  , .  • . . 

i  =  vector  of  phoneme 

j  =  vector  of  speechfile 

I  =  distance  value  in  results  file  from 
Martin's  program 

This  will  preclude  altering  either  the  phoneme  template 
file  or  the  speechfile.  It  should  also  decrease  the  pro¬ 
gram  run  time  and  save  file  space. 

The  third  recommendation  involves  the  synthesized 
speechfile.  An  attempt  should  be  made  to  smooth  out  the 
spectrum  of  the  file  to  avoid  sharp  discrepancies  in  ad¬ 
jacent  vectors.  In  actual  speech  it  is  impossible,  due 


to  the  structure  of  the  vocal  cords,  to  create  sharp  changes 
in  sounds.  There  is  a  transition  between  one  phoneme  to 
the  next  and  this  factor  should  be  simulated  in  synthesized 
speech.  This  can  be  done  by  incorporating  a  decision  routine 
in  the  process  that  examines  groups  of  vectors  for  drastic 
changes  in  spectral  components. 


I 

I 


Bibliography 


It  Potter,  R.  K. ,  George  Kopp,  and  Harriet  Green.  Visible 
Speech..  New  York;  D.  Van  Nostrand  Company,  Inc. ',  1947. 

2.  Seelandt,  Karl  G.  Computer  Analysis  and  Recognition  of 
Phoneme  Sounds  in  Connected  Speech.-  MS  Thesis  GE/EE/81D-53 . 
School  of  Engineering,  Air'  Force  Institute  of  Technology, 
Wright-Patterson  AFB,  Ohio,  December  1981. 

3 .  Montgomery ,  Gerard  J .  Isolated  Word  Recognition  Using 
Fuzzy  Set  Theory.  MS  Thesis  GE/EE/82D-74 .  School  of 
Engineering,  Air  Force  Institute  of  Technolgy,  Wright- 
Patterson  AFB,  Ohio,  December  1982. 

4.  Felkey,  Mark  A.  Automatic  Recognition  of  Phonemes  in 
Continuous  Speech.  MS  Thesis  GE/EE/80D-20."  School  of 
Engineering,  Air  Force  Institute  of  Technolgy,  Wright- 
Patterson  AFB,  Ohio,  December  1980. 

5.  Allen,  Gordon  R.  Expansion  of  the  Nova/Eclipse  Digital 
Signal  Processing  System.  MS  Thesis  GE/EE/82D-16.  School 
of  Engineering,  Air  Force  Institute  of  Technology,  Wright 
Patterson  AFB,  Ohio,  December  1982. 

6.  Martin,  Daviel  L,  Parameter  Analysis  of  the  Speech  Sound. 

MS  Thesis  GE/EE/82D-46 .  School  of  Engineering  Air  Force 
Institute  of  Technology,  Wright-Patterson  AFB,  Ohio, 
December,  1982 

7.  Beachy,  Keith  A.  Computer  Recognition  of  Phonemes  in  the 
Presence  of  Cockpit  Induced  Stress  and  Noise.  MS  Thesis 
GE/EE/82D-20 .  School  of  Engineering,  Air  Force  Institute 
of  Technology,  Wright-Patterson  AFB,  Ohio,  December  1982. 

8.  Devijver,  Pierre  A.  and  J.  Kilther.  Pattern  Recognition; 

A  Statistical  Approach.  London;  Prentice-Hall  Inter¬ 
national,  Inc.,  1982. 

9.  Diller,  Timothy  C.  and  John  F.  Siebenand.  "Speaker 
Independent  Work  Recognition  Using  Sex-Dependent  Cluster¬ 
ing"  ,  IEEE  1981  Acoustiss,  Speech  and  Signal  Processing 
I:  180-183,  1981. 


58 


APPENDIX  A 

COMPUTER 

PROGRAMS 


59 


PROGRAM  speech 


File; 

Language : 

Date: 

Author : 

Subject: 

Calling  Sequence: 


SPEECH 
Fortran  5 
October  19,  1982 
J.  Fletcher 

Phoneme  Speech  file  creation 
SPEECH 


ARGUMENT 

TYPE 

PURPOSE 

I SPEECH 

String 

file  name  of  speech  file 

ILENGTH 

Integer 

vector  length  of  phonemes 

IREC 

Integer 

component  count  for  each  one 

ISIZE 

Integer 

record  size  for  phoneme  file 

IPHON 

Integer 

phoneme  number  selected 

IVECT 

Integer 

starting  vector  for  selected 
phoneme 

INDEX 

Integer 

checks  for  best  vector  in  a 
block 

IBLK 

Integer 

indexes  to  proper  block  of 
speech 

ISPCH 

Integer  Array 

holds  2  blocks  of  speech  read 
from  a  file 

I START 

Integer 

indexes  to  proper  vector  in  a 
block  of  speech 

IARRAY 

Integer  Array 

holds  the  desired  phoneme's 
vectors  pulled  from  TSPCH 

PURPOSE; 

This 

program  is  used  to 

select  the  vectors  or  time  slices 

from  a  speechfile  for  a  user  chosen  phoneme.  The  user  can 
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select  the  phoneme  he  wants  and  the  number  of  vectors  for 
each  phoneme.  The  file  that  is  created  by  this  program, 
PHONFILE,  can  then  be  used  as  a  phoneme  prototype  in 
program  DRVR. 

DESCRIPTION: 

LOCATION:  FLETCHER. DR 

ARGUMENT  STRUCTURE: 

ISPEECH  =  file  name,  13  characters  or  less  in 
Hollerith  string  format 

ILENGTH  *  no  restriction  as  long  as  no  end  of 
file  error  is  returned 

IPHON  =  any  number,  0  will  end  the  program; 

there  is  no  limit  on  how  many  can  be 
created 

IVECT  =  must  be  within  the  bounds  of  the 

speechfile  and  must  be  nonzero 

PROGRAM  USE: 

This  program  can  use  any  speechfile  that  has  been 
created  in  block  format.  This  program  will  select  the 
desired  time  slices  from  the  speechfile  and  put  them  in 
another  speechfile  called  PHONFILE.  The  record  number 
in  PHONFILE  corresponds  to  the  chosen  phoneme  number. 

This  file  can  then  be  run  through  DRVR  to  create  a 
spectrum  file  that  is  used  as  a  prototype  file  in  the 
distance  measurement  subroutines  of  program  DRVR. 


FILES  CREATED: 


PHONFILE: 
record  format, 
phoneme  number; 


digitized  speechfile  created  in  a 
Each  record  number  corresponds  to 
in  this  case,  there  are  71  records 


UOUUUUUUUOUUUtJUUOUUUUUUUUUUUUUU 


PROGRAM:  SPEECH 


WRITTEN  BY: 


CAPT  JANES  E.  FLETCHER 


BATE:  19  OCT  B2 


NOTE:  This  proqraa  is  for  Fortran  5. 


This  proqraa  creates  a  speech  file  of  phoneaes  froa 
existinq  diqitized  speech  files  that  are  in  blocK 
foraat.  This  is  done  by  atloainq  the  user  to  extract 
up  to  5  consectutive  vectors  (tiae  slices)  of  speech 
froa  the  identified  speech  file*  The  vectors  are  then 
stored  in  another  speech  file;  PHONFILE,  as  a  record 
correspondinq  to  the  phoneae  nuaber  selected  by  the  user. 
This  proqraa  also  alloas  the  user  to  select  a  different 
speech  file  by  typinq  “0"  as  a  phoneae  nuaber  and  then 
folloeinq  the  proqraa' s  instructions. 


DIMENSION  I SPEECH (13) ,  ISPCH<512>,  IARRAY (320) 

TYPE“THIS  PROGRAM  CREATES  A  SPEECH  FILE  OF  PHONEMES," 
TYPE“USINC  SELLECTED  VECTORS  FROM  EXISTING  SPEECH  FILES.' 


ACCEPT'NAHE  OF  SPEECH  FILE  TO 
REAB<11,10)  I SPEECH ( 1 ) 
F0RNAKS13) 

CALL  0PEN(1 ,ISPEECH,1 ,IER1 ) 
CALL  CHECEdERll 


BE  USED: 


^Channel  to  speech  file 


ACCEPT"LENGTH  OF  PHONEMES:  ",ILENCTH 
1REC  *  1LENGTH  *  64  {Nuaber  of  coaponents  to  be 
{  extracted  froa  speech  file. 

ISIZE  s  1REC  »  2  {Record  size  for  nea  speech  file 

OPEN  2, "PHONFILE", L£N* ISIZE  {Open  nea  speech  file 

ACCEPT"ENTER  PHONEME  NUMBER  (O'STOP):  ",IPH0N 
IFdPHON  .EQ.  0)  GO  TO  25  {  Check  for  user 

{  aantinq  to  stop 

ACCEPrSTARTING  VICTOR  NUMBER:" ,IVECT{  Beqinninq  vector 

{  of  speech  to  be  extracted 

INDEX  *  HODdVECT ,4)  {Check  for  vector  beinq 

{  last  vector  in  block 
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I1LI  *  ( (IVBCT  ♦  4)  /  4)  -  1  (Indexing  for  proper 

(block 

I F ( INDEX  .EQ.  0)  IBLX- < IVECT/4 > - 1  (If  last  vtctor  of, 

(block  oust  hav*  difforont 
(indexing  equation. 

CALL  RUBLE ( 1 ,IBLX,1SPCH,2,IER2) 

IFUER2  .ME.  1)  TYPE'RDBLI  ERROR  *  ",IER2 

ISTART  *  < ROD ( I VECT , 4 )  -  1)  •  44  (Indexing  to  get  proper 

(  vectors,  using  oodulo  4 

IF (INDEX  .EQ.  0)  1START*3»44  (If  last  vtctor  of  block, 
(■ust  have  different  indexing  equation. 

DO  20  1*1 ,IREC 

IARRAY(I)  *  ISPCH( I+ISTART )  (Put  desired  frageent  of 

(  speech  in  an  array 


CALL  HRITRM(2,IPH0N,IARRAY , & , IER3 > 
IF(IER3  .ME.  1)  TYPE'URITRU  ERROR 
CO  TO  15  (Return  to 


AY,1,1ER3>  (Put  in  PHOHFILE 
TRW  ERROR  *  “,IER3 
(Return  to  get  another  frageent 
(of  speech 


CALL  RESET 

ACCEPT"D0  YOU  KANT  ANOTHER  SPEECH  FILE?(1*Y/0*N) 

IF ( IANS  .EQ.  1)  CO  TO  5 

STOP 

END 


,IANS 
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PROGRAM  TALK2 


Files 

TALK2 

Languages 

Fortran 

5 

Date: 

October 

19,  1982 

Author 

K.  Seelandt  modified  by  J.  Fletcher 

Subject: 

Speech  Synthesis 

Calling  Sequence: 

TALK2  Input  file  Output  file 

ARGUMENT 

TYPE 

PURPOSE 

OUTFILE 

String 

file  name  of  resulting 
synthesized  speech 

OUT3 

String 

file  name  of  top  phoneme 
choice  from  the  recog¬ 
nition  programs 

JUNK,  JUNK2 
JUNK3 ,  JUNK4 

Integers 

time  slice  numbers  from 

OUT 3  file 

P1,P2,P3,P4 

Integers 

top  phoneme  choice  for 
four  consecutive  time 
slices 

SPEECHOUT 

Integer  Array 

holds  the  corresponding 

speech  time  slices  for 
chosen  phonemes  for 
OUTFILE 


SPEECHIN  Integer  Array  holds  the  corresponding 

speech  time  slices  from 
file  PHONEMES 

PURPOSES 

This  program  is  used  to  create  a  speechfile  that  has 
been  synthesized  using  the  top  phoneme  choice  from  a  recogni 
tion  routine,  for  each  time  slice.  The  input  file  is  a 
binary  of  these  top  choices. 


DESCRIPTION: 


LOCATION:  FLETCHER. DR 

ARGUMENT  STRUCTURE: 

OUTFILE  =  name  of  the  speechfile  to  be  created  in 
a  7  character  Hollerith  string  format 

OUT3  =  name  of  the  input  binary  file  holding 

the  top  choices  in  a  7  character 
Hollerith  string  format 

PROGRAM  USE: 

This  program  is  used  to  give  an  audible  indication  of 
how  well  the  recognition  routine  worked  in  choosing  the 
correct  phoneme.  The  program  uses  a  speechfile,  PHONEMES, 
of  phonemes  created  in  record  format.  The  phoneme  value, 
from  OUT 3 ,  for  each  time  slice  is  then  used  to  pick  the 
corresponding  speech  phoneme  and  these  are  strung  together 
to  create  a  new  speechfile  in  block  format. 

FILES  CREATED: 

OUTFILE:  file  name  is  user  defined;  holds  the 

synthesized,  digitized  speech  and  is  written  in  block 
format . 


nonnnnnnnnoonnn  nnoonnonnr>nnor>r»on 


PROCRAB:  T  A  L  X  2 


WRITTEN  BY* 

LT,  KARL  SEELANDT 
HOIIPIED  BY* 

CAPT  JANES  E.  FLETCHER 

BATE:  19  OCT  B2 


COMPILE  IN  FORTRAN  V 

This  proqraa  is  used  to  create  digitized 
speech  using  an  0UT3  file  such  as  the  one  created  by 
LISTER4  (ref*  Seelandt's  prograes)*  Speech  gener* 
ation  is  done  using  a  speech  file  of  the  single  vector 
phoneees  stored  individually  as  records  corresponding  to 
their  phonene  nuebers. (This  file  is  PHONENES*)  The 
top  choice  froa  the  recognition  routine  for  each  tine 
slice  is  stored  in  0UT3*  This  progran  then  reads 
that  top  choices  and  places  the  corresponding  phoneae 
in  another  speech  file*  Since  this  is  done  sequentially, 
a  phoneae  at  a  particular  tine  slice  corresponds  directly 
to  the  top  phoneae  choice  for  a  particular  tiae 
slice  listed  in  0UY3* 


IMPLICIT  INTEGER  (A-Z)  'Alt  variables  are  integers 
DIMENSION  Ml(7) 

INTEGER  0UTFILE(7>,  SPEECH I N 1 ( 6A ) ,  SPEECH0UT(25d> , 
SPEECHIN2U4),  SPEECH  1 H3 ( 64 ) ,  SPEECHIN4 ( 64 ) ,  0UT3(7) 

CALL  I0F(2,H1 ,0UT3,0UTFIIE,I1,M2,I2,I3,I4) 

CALL  OPEHd, "PHONEMES",!,  128,  IERO) 

IFdERO  .HE.  1)  TYPE"ERROR  OPENING  FILE  •  “,IER0 
OPEN  2,0UT3  {Open  channel  to  0UT3. 

DELETE  OUTFILE  {Make  sure  there  is  no 

CALL  CFILM(0UTFILE,2,IER)  {and  create  specified  file. 

CALL  CHECK ( I ER ) 

OPEN  3, OUTFILE 

COUNT  *  0  ?Index  used  to  increaent  block  nuaber  in 

{speech  file 


READ  BINARY(2,ENB*20)  JUNE,  Pi,  JUNX2,  P2, 

JUNX3,  P3,  JUNE4,  P4  {0UT3  file  is  set  up  listing  tine 
{slice  nuaber  and  corresponding  top 
{phoneae  choice* 

CALL  REABRVd ,P1,3PEBCHIH1}1,JER)  {Read  phoneae 

IF( JBR  *NE»  1)  TYPE-JER  *  ",JBR  {froa  phoneae 
CALL  REABRVd, P2,SPEECHIN2,1,IER)  {speech  file 
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IF C EBR  .NS.  1)  TYPE“IER  *  “,1ER 
CALL  RE ADRM(1,P3, SPEECH IN3,1,LER) 

IF  1 LER  .HE.  1)  TYPE'LER  *  “,LER 
CALL  READRH(1,P4,SPEECHIH4,1,HER) 

IF (HER  .HE.  1)  TYPE'HER  >  “,NER 
C 
C 

DO  10  1*1,64 

SPEECHOUT ( I )  *  SPEECHIHl(I)  {Put  the  four  phonetet 
SPEECHOUT( 1+64)  *  SPEECHIN2U)  {into  one  array  to 
SPEECHOUTU+128)  *  SPEECHIH34 1 >  {create  a  block 
SPEECHOUTf 1+192)  *  SPEECHIH4( I )  {of  speech. 

10  CONTINUE 

CALL  HRBLK3, COUNT, SPEECHOUT, 1, HER)  {Put  block  of  speech 
IFfNER  »NE.  1)  TYPE"HRBLK  ERROR:  “ ,NER  {in  speech  file. 
COUNT  *  COUNT  +  1  {Increment  block  counter 

C 
C 

GO  TO  1  {Start  over  and  continue  until  coae  to 

{end  of  0UT3. 

20  URITEt 10,30)  OUTFILE(l) 

30  FORHATCFILE  *,813,'  HAS  BEEN  CREATED. “) 

CALL  RESET 

STOP 

END 
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PROGRAM  PLOTPHON 


File: 

PLOTPHON 

Language : 

Fortran 

5 

Date: 

September  28,  1982 

Author 

J.  Fletcher 

Subject: 

phoneme 

recognition 

Calling  Sequence:  PLOTPHON 

Subroutines 

used:  RDOUT1, 

RDOUT5 ,  GRPH2 

ARGUMENT 

TYPE 

PURPOSE 

INCHOICE 

INTEGER 

For  choosing  a  routine  to  read 
either  an  OUTl  or  OUT 5  file 

OUTl 

string 

file  name  to  be  read 

TITLE 

string 

Information  to  be  printed  with 
graph  gives  words  spoken  and 
speaker 

IPHON 

Integer 

Phoneme  number  to  be  plotted 

IPRINT 

Integer 

For  choosing  whether  or  not  to 
print  out  amplitude  file. 

PURPOSE : 

This  program  is  used  to  show  graphically  when  a  user 
chosen  phoneme  is  picked  as  a  top  five  choice  in  a  speech- 
file.  The  graph  is  plotted  on  the  Tektronix  4010-1  Graphic 
Terminal  along  with  the  phoneme  number,  the  file  used,  the 
words  spoken,  and  the  speaker. 

DESCRIPTION: 

LOCATION:  FLETCHER.  DR 


ARGUMENT  STRUCTURE; 

ICHOICE  =  1  for  choosing  an  0UT1  file  or  0  for  choosing 
an  OUT 5  file 

0UT1  =  the  desired  file  to  be  used  in  Hollerith 

string  format 

TITLE  =  works  spoken  and  speaker  in  Hollerith  string 

format 

TP HONE  =  01  to  71  for  OUT1  files  or  001  to  071  for 

OUT 5  files 

IPRINT  =  1  to  print  resulting  amplitude  file,  0  other¬ 
wise 

PROGRAM  USE; 

This  program  can  use  the  recognition  results  for  a  given 
speechfile  that  has  been  run  through  either  Seelandt's 
recognition  routine,  TRYDIST5  and  LISTER4 ,  or  Martin's  recog¬ 
nition  routine,  DRVR.  The  top  five  phoneme  choices  from 
Seelandt's  routine  are  stored  in  file  OUT1.  File  OUT5  is 
created  by  taking  the  file  of  Martin's  top  five  choices  and 
running  it  through  program  TOP5.  OUT1  and  OUT5  can  have  any 
file  name  and  are  referred  to  as  OUT1  and  OUT5  only  to  dif¬ 
ferentiate  between  the  two  types  of  files  since  they  are 
formatted  differently. 

In  order  to  load  this  program  the  load  line  must  be: 

RLDR  PLOTPHON  RDOUTl  RD0UT5  GRPH.LB 
@FLIB@ 

Make  sure  you  have  linked  to  GRPH.LB  which  is  in  directory 
DP4F. 
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PROGRAM  PLOTPHON 
WRITTEN  BY :  JAMES  E.  FLETCHER 
DATE:  26  SEP  62 


NOTE:  This  prograa  is  for  Fortran  3. 

PLOTPHOH  usts  a  file  of  the  top  five  phoneae  choices  and 
their  associated  scores  for  each  tiae  slice. 

These  files  are  the  QUT1  files  created  by  PHDIST  and  CHOICES 
and  the  0UT5  files  created  by  DRVR,  its  associated  subroutines 
and  by  LISTTOPS.  PLOTPHOH  searches  each  tiae  slice  for  a  user 
selected  phoneae.  If  one  is  found  it's  associated  score  is 
placed  in  an  array.  If  one  is  not  found  for  that  tiae  slice,  a 
zero  is  placed  in  the  array.  The  array  of  scores  is  then  noraal- 
ized  to  a  value  betaeenone  and  zero.  The  scores  are  then  plot¬ 
ted  on  a  graph  aith  the  phoneae  score  on  the  Y-axis  and  the  tiae 
slice  nuaber  on  X-axis.  This  prograa  utilizes  three  subroutines: 

GRPH.LB,  RD0UT1,  RD0UT5.  Subroutine  RD0UT1  reads  0UT1  files, 
searches  for  the  desired  phoneae  and  its  score,  noraalizes  that 
score  and  returns  the  array  of  scores  to  the  aain  prograa. 

Subroutine  RD0UT5  does  everything  RDOUTldoes  but  reads  OUTS 
files  instead  of  0UT1  files.  GRPH.LB  is  a  packaged  library 
subroutine  that  plots  graphs.  A  paaphet  is  available  for  GRPH.LB. 

NOTICE:  This  prograa  should  used  only  on  the  TEKTRONIX  6010 
teainal . 

DIMENSION  SCARRAYI400),  0UTK13),  TITLEOO),  ISAID(SO),  ISPEAKRU3) 

I0PT  *  0 

TYPE“THIS  PROGRAM  CAN  USE  TWO  TYPES  OF  FILES," 

TYPE”EITHER  0UT1  OR  0UT5  FILES," 

1  ACCEPT'  DO  YOU  WANT  AN  0UT1  OR  0UT5  F I LE? ( 1  * OUT 1 /0 * 00T5 ) " , ICHO I CE 

2  ACCEPT "NAME  OF  FILE:  ' 

READ (11 ,3)  0UT1U) 

3  FORMAT ( S 1 3 ) 

ACCEPTHWORDS  SPOKEN  AND  SPEAKER:  " 

READ (11,4)  TITLE(l) 

4  FORMAT ( S30 ) 

5  CALL  0PEN(2,0UTl,l,IERl)i  This  altoes  the  coaputer  to  read  the  0UT1  file. 
CALL  CHECK(IERl>i  Check  for  an  error  during  file  opening. 

TYPB"NOTBi  " 

TYPE"  TO  CONTINUE  AFTER  GRAPH  HAS  BEEN  D1SPAYED," 

TYPB"  HIT  ANY  KEY.  - 
IFdOPT  .EQ.  2)  60  TO  9 

IFdCHOlCE  .EQ,  1)  60  TO  7)  Phoneae  foraatted  differently  for 

lOUTl. 

ACCEPT'ENTER  PHONEME  NUMBER  (001  TO  071):  " 

READ (11 ,6)  IPHON 


71 


u  a  a  o 
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FORMAT (13) 

CO  TO  9 

7  ACCEPT'EMTER  PHONEME  HUMBER  (01  TO  71) s  " 

READ (11/6)  IPHON 
6  FORMAT ( A2 ) 

9  ACCEPT"DO  YOU  WANT  TU  PRINT  OUT  AHPL«  FILE?  < 1*Y/0=H)  “,IPRIMT 
IMIH  =  600;  No  score  should  ever  exceed  this  value. 

INAX  *  0 

ICONT  *  0$  Initializes  flag  for  counting  no.  of  tiees  phoncee 
;  appears  in  0UT1  or  OUTS  file. 

IFUCHOICE  .KQ.  1)  CALL  R00UT1 (MAX , MIN, ICONT ,SCARRAY ,IVECT , 

/  IPHON, IMAX, ININ) 

IF ( ICHOICE  .EQ.  0)  CALL  RDOUTS(MAX ,MIN, ICONT, SC ARRAY, I VECT , 

/  IPHON, IHAX, IHIN) 

60  IFdCONT  .CT.  1)  GO  TO  65 

WRITE(10,6i)  IPHON,  ICONT ff  phoneee  appears  .LT.  teo  tiees 

61  FORHAT(// ,A2,"  APPEARS  ONLY  ",I1,“  TIME(S).");  the  prograe  is 
TYPE"PROGRAH  UNABLE  TO  GENERATE  AN  OUTPUT.";  unable  to  plot 

;a  graph. 

ACCEPT“TYPE  ANY  NUMBER  TO  CONTINUE.  ",IPAUS 
GO  TO  90 
65  CONTINUE 

DO  70  I*1,IVECT;  Perfore  the  norealization 

IF(SCARRAYd)  .GT.  0)  SCARRAY (I ) = ( (SC ARRAY ( I ) -MIN)/ (MAX-MINI ) 

70  CONTINUE 

IFdPRIHT  .HE.  1)  GO  TO  82;  Skip  if  don't  eant  array  printed. 
IFdCHOICE  .EQ.  1)  GO  TO  74;  Phoneee  foreatted  differently  for 

;0UT1. 

WRITE(12,71)  IPHON 

71  FORMAT!"  AMPLITUDES  FOR  PHONEME  ",I3,//>;  Header  info  for 

URITEt 12,75)  IPHON  printout 

75  FORMAT (“  AMPLITUDES  FOR  PHONEME  “,A2,//);  Header  info  for 

IIRITE(  12 ,76)  OUTl(l)  printout 

76  FORMAT!"  FROM  FILE  ",S13,//>;  Header  info  for  print  out. 
URITEt 12 ,60 )  (SCARRAY (I ) ,  I-1,IVECT);  Print  hard  copy  of  score 

60  F0RHAT(5X,F6.3,5X,F6»3,5X,F6.3,5X,F6.3,5X,F6.3)  ;array 

WRITE! 12,61 )  MAX,  MIN 

81  FORMAT!//, 5X, "MAX  SCORE  =  ”,F4,0,5X,"HIN  SCORE  *  “,F4.0) 

62  CALL  GRPH2! “PHONEME  AHPL." ,1, SCARRAY ,U,IVECT,0 ,XM1N,XMAX,0) 


GRPH2  is  the  subroutine  that  plots  the  norealized  scores  on 
a  graph. 
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65 

66 
67 

66 

96 


IFdCHOICE  .EQ.  1)  GO  TO  84 

UR1TE 1 10 ,83)  IPHON,  OUTl(l);  Header  info  for  graph. 
FORHATTAMPLITUDES  FOR  PHONEME  >  “,I3,”  FROM  FILE  “,S13> 
GO  TO  66 

URITE!  10,65)  IPHON,  OUTld);  Header  info  for  graph. 
FORMAT (“AMPLITUDES  FOR  PHONEME  »  ",A2,”  FROM  FILE  “,S13) 
URITE! 10,87)  TITLE! 1 ) g  Header  info  for  graph. 

FORMAT (“WORDS  SPOKEN  AND  SPEAKER:  “,S30> 

READ(11 ,86)  IPAUSE 

FORHAT(Sl);  Pause  until  user  ready  to  clear  sceen. 

CALL  RESET 
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CALL  ERS(l);  Subroutine  to  erase  the  screen* 

CALL  TDELAY(2){  belay  to  allot  tine  to  clear  screen  before 
^continuing* 


I 


I 


TYPE'QPTIONSs* 

TYPE*  1.  USE  SAKE  FILE,  BUT  DIFFERENT  PHONEME." 

TYPE*  2.  CET  DIFFERENT  FILE,  BUT  USE  SANE  PHONEME. (MUST  BE" 
TYPE*  SANE  TYPE  FILE,  EITHER  0UT1  OR  0UT5>" 

TYPE"  3.  CET  DIFFERENT  FILE  AND  DIFFERENT  PHONEME." 

TYPE"  4.  END  PROGRAM." 

ACCEPT'ENTER  OPTION  NUMBERS  ",IQPT 


IF(IOPT 

.EQ. 

1) 

CO 

TO 

5 

I F 1 1 OPT 

.  EQ. 

2) 

CO 

TO 

2 

IF ( IOPT 

•  EQ. 

3) 

GO 

TO 

1 

STOP 

END 


i  *• 


H 
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ft 


i 
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SUBROUTINE  RDOUT1 


Files 


RDOUT1 


Language : 
Date: 
Author : 


Fortran  5 

September  28,  1982 
J.  Fletcher 


Subject:  Reading  output  file  of  program  CHOICE 

Calling  Sequence:  Call  RD0UT1  (MAX,  MIN ,  INCONT,  SCARRAY, 

IVECT,  IPHON,  IMAX,  IMIN} 


ARGUMENT 

TYPE 

PURPOSE 

MAX 

Real 

Maximum  distance  for  specific 
phoneme  in  speechfile 

MIN 

Real 

Minimum  distance  for  specific 
phoneme  in  speechfile 

ICONT 

Integer 

Count  of  the  number  of  times 
specific  phoneme  shows  up  in 
output  file 

SCARRAY 

Real  Array 

Array  of  distances  for  specific 
phoneme 

IVECT 

Integer 

Time  slice  number  where  phoneme 
was  chosen  as  a  top  five  choice. 

IPHON 

Integer 

Selected  phoneme  number  for  which 
distances  are  taken  from  output 
file 

IMAX 

Integer 

Maximum  distance  for  specific 
phoneme  in  speechfile 

IMIN 

Integer 

minimum  distance  for  specific 
phoneme  in  speechfile 

PURPOSE: 


This  subroutine  is  used  in  program  PLOTPHON  and  reads 


the  value  from  the  output  file  of  program  CHOICE ,  0UT1. 


DESCRIPTION; 


Loca t ion ;  FLETCHER ,  DR 
ARGEMENT  STRUCTURE: 

MAX 


MIN 

ICONT 

SCARRY 

IVECT 

IPHON 

IMAX 

IMIN 

PROGRAM  USE: 


=  Used  to  compute  the  amplitude  of  the  dis¬ 
tances  in  main  program. 

=  Used  to  compute  the  amplitude  of  the  dis¬ 
tances  in  main  program 

=  Used  as  a  flag  to  notify  user  there  are  not 
enough  distance  values  to  plot  a  graph. 

=  Passes  distance  values  to  main  program: 
these  will  be  converted  to  distance 
amplitudes . 

»  Used  in  graph  routine  to  number  points  on 
X-axis. 

=  Can  be  an  integer  from  01  to  71;  used  to 
pick  out  corresponding  distances  in  OUT1 
file 

=  Initial  maximum  value  passed  to  subroutine 

=  Initial  minimum  value  passed  to  subroutine 


This  subroutine  is  to  be  used  with  its  main  program, 
PLOTPHON.  It  is  to  be  loaded  in  the  fashion  illustrated  in 
the  documentation  for  program  plotphon.  It  will  only  read 
a  file  of  the  top  five  phoneme  choices  that  has  been  for¬ 
matted  in  the  same  manner  as  the  0UT1  file  created  by  program 
CHOICE. 
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SUBROUTINE  RBOUTHNAX, HIN, ICONT, SCARRAY, IVECT, IPHON, INAX, ININ) 


NOTE:  This  prograa  is  for  Fortran  5. 

Routine  RD0UT1  is  used  by  the  aain  prograa  PLOTPHON. 

It  uses  the  initialized  flag,  ICONT;  the  desired  phoneae,  IPHON, 
and  the  initialized  aaxiaua  and  ainiaua  scores/  IKAX  and  ININ* 

It  returns  to  the  aain  prograa  the  updated  floating  point 
aaxiaua  and  ainiaua  scores/  NAX  and  HIN;  the  updated  flap/ 

ICONT;  the  nuaber  of  tiae  slices  in  the  0UT1  file/  IVECT/  and 
the  array  of  scores*  R00UT1  goes  to  the  0UT1  file  and  searches 
for  the  desired  phoneae*  Upon  locating  it/  it  places  its 
score  in  the  array  at  the  corresponding  tiae  slice  nuaber* 

Every  tiae  it  finds  the  desired  phoneae  it  also  increaents 
ICONT*  It  also  checks  to  see  if  the  score  is  a  nee 
aaxiaua  or  a  nea  ainiaua. 


DIHENSION  SC ARRAY ( 400 > 

BO  8  1*1/400 
SCARRAYU)  *  0 
CONTINUE 

READ(2/20/EHB:60)  IVECT,  IPH0N1,  IVAL1,  IPH0N2,  IVAL2,  IPH0N3, 


IVAL3,  IPH0N4,  IVAL4,  IPH0N5,  IVAL5  ;  Read  in  phoneaes  8  scores 
F0RHAT(5X,I4,3X,A2,"  ",I3,2X,A2,"  " ,I3,2X,A2,“  " ,13/ 

2X,A2,“  "/I3/2X/A2,"  “ ,13) 

These  if  stateaents  check  to  see  if  any  of  the  phoneaes  Batch 
the  desired  phoneae*  If  a  natch  is  found,  its  score  becoaes 
ISCORE  and  ICONT  is  increaented. 

IF (IPHON  «EQ.  IPH0N1)  CO  TO  30 
IFUPH0N  *EQ,  IPH0N2)  CO  TO  31 
IFdPHOH  *EQ.  IPH0N3)  CO  TO  32 
IF ( IPHON  *EQ.  IPH0N4)  GO  TO  33 
IFdPHON  «E0.  IPH0N5)  CO  TO  34 


CO  TO  10 
ISCORE  « 


IVAL1 


ICONT  ■  ICONT  ♦  1 
CO  TO  40 
ISCORE  *  IVAL2 
ICONT  *  ICONT  ♦  1 
CO  TO  40 
ISCORE  *  IVAL3 
ICONT  '  ICONT  ♦  1 
CO  TO  40 
ISCORB  *  IVAL4 
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I CONY  «  ICONT  ♦  1 
60  TO  40 
ISCORE  *  IVAL5 
ICONT  -  ICONT  ♦  1 

IFUSCORE  .CT.  I H AX >  IHAX  *  ISCORE;  Sets  IHAX  to  the  highest 

;score 

IFUSCORE  .LT.  ININ)  ININ  *  ISCORE;  Sets  ININ  to  the  loeest 

;score. 

SCORE  *  FLOAT< ISCORE) ;  Change  to  floating  point 
NAX  :  FLOAT(IHAX)  ;nuebers 
NIN  *  FLOAT! ININ) 

SCARRAY(IVECT)  *  SCORE;  Put  score  into  array* 

GO  TO  10;  Continue  until  end  of  file  is  reached* 

RETURN 

END 


Jl 
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SUBROUTINE  RDOUT5 


File: 

RDOUT5 

Language 

:  Fortran 

5 

Date: 

September  28,  1982 

Author: 

J.  Fletcher 

Subject: 

Reading  output  file  of  program  TOPS 

Calling 

Sequences:  Call  RD0UT5  (MAX,  MIN,  ICONT,  SCARRAY, 

IVECT,  IPHON,  IMAX,  IMINJ 

ARGUMENT 

TYPE 

PURPOSE 

MAX 

Real 

Maximum  distance  for  specific 
phoneme  in  speechfile 

MIN 

Real 

Minimum  distance  for  specific 
phoneme  in  speechfile 

ICONT 

Integer 

Count  of  the  number  of  times 
specific  phoneme  shows  up  in  out¬ 
put  file 

SCARRAY 

Real  Array 

Array  of  distances  for  specific 
phoneme 

IVECT 

Integer 

Time  slice  number  where  phoneme 
was  chosen  as  a  top  five  choice 

IPHON 

Integer 

Selected  phoneme  number  for  which 
distances  are  taken  from  output 
file 

IMAX 

Integer 

Maximum  distance  for  specific 
phoneme  in  speechfile 

IMIN 

Integer 

Minimum  distance  for  specific 
phoneme  in  speechfile 

PURPOSE: 

This  subroutine  is  used  in  program  PLOTPHON  and  read  the 
values  from  the  output  file  of  program  T0P5,  0UT5. 


DESCRIPTION; 


Location;  FLETCHER.  DR 


ARGUMENT  STRUCTURE: 


MAX 

MIN 

ICONT 

SCARRAY 

IVECT 

IPHON 

IMAX 

IMIN 

PROGRAM  USE: 


-  Used  to  compute  the  amplitude  of  the  dis¬ 
tance  in  main  program 

=  Used  to  compute  the  amplitude  of  the  dis¬ 
tance  in  main  program 

=  Used  as  a  flag  to  notify  user  there  are  not 
enough  distance  values  to  plot  a  graph 

=  Passes  distance  values  to  main  program; 
there  will  be  converted  to  distance  ampli¬ 
tudes 

=  Used  in  graph  routine  to  number  points  on 
X-axis 

=  Can  be  an  integer  from  02  to  71;  used  to 
pick  out  corresponding  distances  in  OUT1 
file 

=  Initial  maximum  yalue  passed  to  subroutine 

=  Initial  minimum  value  passed  to  subroutine 


The  subroutine  is  to  be  used  with  the  main  program, 
PLOTPHON.  It  is  to  be  loaded  in  the  same  fashion  as  out¬ 
lined  in  the  documentation  for  program  PLOTPHON.  It  performs 
the  same  function  as  subroutine  RD0UT1,  but  will  only  read  a 
file  formatted  in  the  same  manner  as  the  OUT5  file  created 
by  program  TOP 5. 


non  nnnnnnonononnnnooonnonono 


SUIROUTINE  RD0UT5(HAX,HIN,IC0NT,BCARRAY,IVECT,IPH0N,IHAX,IHIH) 


ROUTINE  II  10  II  !  5 
WRITTEN  BY:  JANES  E.  FLETCHER 
DATE:  21  SEP  62 


NOTE:  This  prograa  is  for  Fortran  5. 

Routine  RD0UT5  is  used  by  the  eain  prograa  PL0TPH0H. 

It  uses  the  initialized  flag,  ICONT;  the  desired  phoneae,  IPHON, 
and  the  initialized  aaxiaua  and  ainiaua  scores,  IHAX  and  ININ. 

It  returns  to  the  aain  prograa  the  updated  flag,  ICONT?  the 
nuaber  of  tiae  slices,  IVBCT,  in  the  OUTS  file  and  the  array 
of  scores.  RD0UT5  goes  to  the  0UT5  file  and  searches  for  the 
desired  phoneae.  Upon  locating  it,  it  places  its  score  in  the 
array  at  the  corresponding  tiae  slice  nuaber.  Every  tiae  it 
finds  the  desired  phoneae  it  also  increaents  ICONT.  It  also 
checks  to  see  if  the  score  is  a  nea  aaxiaua  or  a  nea  ainiaua. 


DIHENSION  SC  ARRAY  ( 400  > ,  ISA1DI30),  ISPEAXRU3) 

DO  6  1*1,400 
SCARRAY (I)  *  0 
6  CONTINUE 

The  folloaing  seven  READ  stateaents  just  reads  off  the  0UT5 
file  header  to  get  you  into  the  aain  body  of  the  file. 

READ(2,11 )  ISAID(l) 

11  F0RNAT(15X, “SENTENCE  SP0IEN  :  ”,S50,/> 

READ (2,12)  ISPEAKR(l) 

12  F0RNAT ( IPX, "SPEAKER  HAS  :  “,S13,/> 

RE AD C 2 ,13)  IDATE 

13  F0RHAT(15X,"THE  DATE  IS--" ,213,15) 

READ ( 2 , 1 4 >  IH0UR ,  ININ,  ISEC 

14  F0RNATI 15X,“THE  TINE  IS- ,313,//) 

READ  <  2 , 15  > 

READ  12,16) 

READ (2,17) 

Ceeeeeee 

15  F0RNAT(4X, “VECTOR", 4X, "FIRST", 3X, "SECOND", 4X, "THIRD", 

/  3X, "FOURTH", 4X, "FIFTH", 7X, "SCALE", 4X, "VECTOR") 

16  F0RNAT(4X,"NUNBER",3X, "CHOICE", 3X, "CHOICE", 3X, "CHOICE", 

/  3X, "CHOICE", 3X, "CHOICE", 6X, "FACTOR", 6X, "ENERGY") 

17  FDRNATIAXj'HHH’^Xj’HiHH'jSX^mHir^Xj'imm", 

/  2X, "*tt#ee»“,2X,"»»»eee#",5X, "eeeteee", 3X, "eeeeeee",/) 

10  READ(2,20,END*60)  1VECT,  IPH0N1,  IVAL1,  IPH0N2,  IVAL2,  IPK0N3, 

/  IVAL3,  IPH0N4,  IVAL4,  IPH0N5,  IVAL5{  Read  in  phoneaes  and 

^scores. 

FORNAT ( 5X ,14 ,3X / 1 3 ,"  ",13,2X,I3,"  ",I3,2X,I3,"  ",I3, 


80 


20 


I 


E  I* 


C 

C 

c 

c 


30 


31 


32 


33 


34 


40 


60 


2X/i3,“  a,i3,n,n,m  *#i3) 

These  IF  stataaents  check  to  set 
the  desired  phoneae.  If  a  aatch 
ISCORE  and  ICONT  is  increaented. 


if  any  of  the  phoneaes  aatch 
is  found,  its  score  becoaes 


IFdPHOH  .EQ. 
IFdPHOH  .EQ. 
IFdPHOH  .EQ. 
IFdPHOH  .EQ. 
IFdPHOH  .EQ. 

CO  TO  10 
ISCORE  *  IVALl 
ICONT  *  ICOHT  ♦  1 
CO  TO  40 
ISCORE  *  IVAL2 
ICONT  *  ICONT  ♦  1 
GO  TO  40 
ISCORE  *  IVAL3 
ICOHT  =  ICOHT  +  1 
GO  TO  40 
ISCORE  *  IVAL4 
ICONT  -  ICONT  ♦  1 
GO  TO  40 
ISCORE  *  IVALS 
ICONT  *  ICONT  ♦  1 
IF( ISCORE  .GT.  I H A X > 


IPH0H1)  GO  TO  30 
IPH0H2)  GO  TO  31 
1PH0N3)  GO  TO  32 
IPH0N4)  GO  TO  33 
IPH0N5 )  GO  TO  34 


I H AX  =  ISCORE? 


Sets  INAX  to  the  highest 
?score. 

IF ( ISCORE  .LT.  II4IN >  ININ  -  ISCORE?  Sets  ININ  to  the  lonest 

?score. 

change  score  to  floating  point 
?nuaber . 


SCORE  ;  FLOAT (ISCORE) ? 
NAX  *  FLOAK  INAX) 

HIN  *  FLOAT! ININ) 
SCARRAY (IOECT)  *  SCORE? 
GO  TO  10?  Continue  unti) 
RETURN 
END 


Put  SCORE  into  array, 
end  of  file  is  reached. 


i 
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PROGRAM  GENOUT3 


File: 

GEN0UT3 

Language : 

Fortran 

5 

Date: 

November  2,  1982 

Author : 

J.  Fletcher 

Subject: 

Generation  of  top  choice  file 

Calling  Sequence:  CHNGFILE 

ARGUMENT 

TYPE 

PURPOSE 

IFILE 

String 

file  name  of  top  5  phoneme 
distances 

IHEADER 

Integer  Array 

leader  information  of  top  5 
file 

ILAST 

Integer 

number  of  time  slices  in  speech- 
file 

IRRAY 

Integer  Array 

holds  one  block  information 
from  top  5  file 

ISLICE 

Integer 

time  slice  number  of  top  5 
phoneme  choice 

I VAL1  - 1 VAUL  2 

Integer 

unused  values  from  top  5  file 

IPHON 

Integer 

top  phoneme  choice  for  a  time 
slice 

DESCRIPTION: 

Location: 

FLETCHER.  DR 

ARGUMENT  STRUCTURE: 

IFILE  =  name  of  top  5  phoneme  distances  from  DRVR: 

formatted  in  13  character  Hollerith  string 

PROGRAM  USE: 


This  program  creates  an  0UT3  file  to  be  used  as  an  input 


file  for  program  TALK2.  It  uses  a  file  created  by  DRVR  when 
the  option,  choose  5  best  distances,  is  used. 

FILES  CREATED: 


TEMFILE 


=  contains  everything  that  is  in  IFILE 

except  the  header  block:  written  in  block 
format 

=  contains  the  time  slice  and  corresponding 
top  phoneme  choice:  written  in  WRITE 
BINARY  format 


OUT  3 


ono  oo  oooor>r>oor»r»r»r>r»oor»oooooo 
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PROGRAM:  6  E  N  0  U  T  3 

«« 

- J 

II 

•s 

•e 

BY:  CPT  JANES  E.  FLETCHER 

ee 

1 

»» 

•• 

ii 

DATE:  2  NOV  B2 

•« 

•» 

** 

NOTE}  THIS  IS  FOR  FORTRAN  5. 

This  prograa  takes  the  file  created  by  DRVR  for  the  five  best 
distances  when  a  speech  file  is  banged  against  a  phoneee  file  and 
creates  an  0UT3  file.  The  0UT3  file  it  a  list  of  the  tine  slices  in 
a  speech  file  and  its  corresponding  top  phoneee  choice* 

The  0UT3  file;  once  created ,  can  non  be  run  in  progran  TALX2  to 
resynthesize  the  speech  recognized  by  DRVR.  This  gives  an  audible 
indication  of  hoe  eel  I  DRVR  did  at  recognizing  speech. 


DIHENSION  I ARRAY ( 256 ) ,  IHEADERt 256) 

ACCEPT”F1LE  NAHE  OF  TOP  5  DISTANCES  FRON  DRVR:  * 

RE AD ( 1 1 >5 )  IFILE 
5  FORMAT ( S 1 3 ) 

CALL  0PEN(1,IFILE,1,IER1> 

CALL  CHECK ( IER 1 ) 

CALL  F0PENI2, "TENFILE")  ;  File  used  to  temporarily  store 

{the  top  5  choices  minus  the  header  block. 
CALL  RDBLIfl ,0,1HEADER,1 , 1 ERO > 

CALL  CHECK (IERO) 

ILAST  *  IHEAOER ( 41 >  ;Value  for  »  of  tine  slices  in  speech  file. 

TYPE" ILAST  *  ”, ILAST 

ISTART1  *  1  ;  Start  block  for  reading  top  5  file. 

ISTART2  *  0  5  Start  block  for  eriting  to  TENFILE. 

10  CALL  RDBLX( 1 , ISTART1 , I ARRAY , 1 , IER2 )  ;  Read  top  5  file. 

IF ( IER2  .EQ.  9)  CO  TO  20 

ISTART1  *  1START1  +  1  ;  Increment  to  neat  block. 

CALL  URBLX<2,ISTART2,IARRAY,1 , 1 ER3 )  ;  Write  to  TENFILE. 

CALL  CHECK ( IER3 ) 

ISTART2  -  ISTART2  ♦  1  {Increnent  to  next  block. 

CO  TO  10 

20  CALL  RESET 


TYPE"TEHFILE  HAS  BEEN  CREATED." 

CALL  F0PEN(2, "TENFILE") 

CALL  BFILHC0UT3",IER3) 

IF(IER5  .HE.  1)  TYPE  ”IER3  «  *,IER3 
CALL  F0PBN(3,"0UT3") 

A  string  of  values  is  read  in  to  get  the  tine  slice  nueber 
(ISL1CI)  and  phoneee  nueber  (IPHON).  Reading  in  the  string 
of  values  after  IPHON  sets  you  up  to  read  in  the  next  tine 
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■ 
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s)ic«  nuaber. 

30  READ  BINARY(2,ENDM0)  ISLICE,  IVAL1,  IVAL2,  IVAL3,  IPHOH,  IVAL4 , 

/  1VAL3 ,  IVAU,  1VAL7 ,  IVAL8,  IVAL?,  IVAL10,  IVAlll,  IVAL12 
WRITE  BINARY < 3 )  ISLICE,  IPHOH  (Write  tiae  slice  4  its  top 

;  phoneae  to  QUT3. 

IPdSLICE  .EQ.  HAST)  GO  TO  40  ;  ChecK  for  last  tiae  slice  nueber  (In  case 

;  top  5  file  overruns  speech  file* 

GO  TO  30 

40  CALL  RESET 

CALL  DFILU(”TEHFILE",IER4)  ;  Delete  teaporary  file  holding 

;  top  5  choices* 

CALL  CHECK1ER4) 

TYPEMTEf!FILE  HAS  BEEN  DELETED.” 

TYPE”  " 

TYPE"0UT3  HAS  BEEN  CREATED.” 

STOP 

END 


to 


■ 
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PROGRAM  CHNGFILE 


File: 

CHNGFILE 

Language : 

Fortran  5 

Date: 

November 

1,  1982 

Author : 

J.  Fletcher 

Subject: 

Spectrum 

file  for  program  DRVR 

Calling  Sequence:  CHNGFILE 

ARGUMENT 

TYPE 

PURPOSE 

IFILE 

String 

identified  spectrum  file  to 
be  used 

INEWFIL 

String 

identifies  file  name  for 
altered  spectrum  file 

I HEADER 

Integer  Array 

holds  header  information  of 
spectrum  file 

ICOMP 

Integer 

number  of  frequency  components 
per  time  slice 

I HEADER  (56) 

Integer 

number  of  time  slices  in 
spectrum  file 

SPCHARY 

Real  Array 

holds  2  blocks  of  spectral 
components 

ILENVAL 

Integer 

number  of  time  slices  in 
spectrum  file;  user  can  alter 

I HEADER  (56)  for  using  SPENPLOT 

I LENGTH 

Integer 

time  slice  length  of  phonemes 

ARRAY1 

Real  Array 

altered  spectrum  of  speechfile 

PURPOSE: 

This  program  alters  the  original  spectrum  of  a  speech- 
file  that  has  had  a  64  point  DFT  performed  on  it.  This  allows 
DRVR  to  perform  2  to  4  vector  phoneme  recognition  on  a  speech 
file. 
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'  AD-A144  562  PERFORMANCE  IMPROVEMENTS  OF  THE  PHONEME  RECOGNITION 
ALGORITHMCU)  AIR  FORCE  INST  OF  TECH  WRIGHT-PflTTERSON 
AFB  OH  SCHOOL  OF  ENGINEERING  J  E  FLETCHER  JUN  84 
UNCLASSIFIED  AFIT/GE/EE/84J-2  F/G  17/2  NL 


MICROCOPY  RESOLUTION  TEST  CHART 

NATIONAL  BUREAU  OF  STANDARDS- 1963-A 


DESCRIPTION: 


Location:  FLETCHER.  DR 
ARGUMENT  STRUCTURE: 

IFILE  =  name  of  original  spectrum  file  created  by 

DRVR:  formatted  as  a  13  character  Hollerith 

string 

INEWFIL  =  name  of  spectrum  file  alter  by  CHNGFILE  in 
13  character  Hollerith  string  format 

ICOMP  =  integer  that  is  a  multiple  of  32 

ILENVAL  =  integer  value  equal  to  original  number  of 
time  slices  when  SPENPLOT  (ref  7)  is  to 
be  used 

ILENGTH  =  integer  value  of  2  or  4  depending  on  phoneme 
length  used  as  prototype 

PROGRAM  USE: 

This  program  is  to  be  used  only  on  a  spectrum  file  for¬ 
matted  as  the  spectrum  file  created  by  program  DRVR  (Ref  6) . 
In  this  spectrum  there  is  a  one  block  header  attached  to 
the  file  and  every  thirty-second  element  is  the  energy  of 
that  time  slice.  This  program  is  to  be  used  on  both  the 
spectrum  of  the  speechfile  and  the  phoneme  speechfile. 

FILES  CREATED: 

INEWFIL:  file  name  is  user  defined:  holds  altered 

spectrum  components  and  is  written  in  block 
format  with  a  one  block  header  attached  and 
each  thirty-second  component  is  a  constant 
1.0. 


oonoonno  on  n  o  o  o  no  non  o  o  rs  <■>  ooooonrtonooonnooopiooo 


•• 

PROGRAM:  C  H 

N  G  F  I  L  E 

•  » 

BY:  CPT  JANES 

E.  FLETCHER 

•» 

it 

DATE:  1  NOV 

B2 

«• 

•  » 

NOTE:  THIS  PROGRAM  IS  FOR  FORTRAN  5. 

Basically,  the  purpose  of  this  proqraa  is  to  alter  the  spectrua 
files  created  by  dartin' s  DRVR  proqraa  in  order  to  perfore  phoneee 
recognition  on  speech  files  using  different  length  phoneee  teaplates. 
Tms  prograe  perforas  tao  tasKs: 

1.  It  takes  the  spectrua  file  of  the  phoneae  teaplate  and 
sets  the  energy  value  to  the  constant  1.0,  this  is  the 
first  aord  in  each  group  of  32  aords  Baking  up  a  tiae  slice's 
frequency  coaponents.  it  a  so  changes  the  inforaation  in 

the  header  block  so  that  the  correct  nuaber  of  coaponents  are 
banged  against  each  otherti.e.  for  a  4  vector  phoneae,  a  group 
of  12B  coaponents  oust  be  banged  together).  The  other 
value  to  be  changed  in  the  header  block  is  the  nuaber  of 
vectors  in  the  nea  spectrua  file  (this  depends  on  the  nuaber 
of  vectors  per  phoneae). 

2.  It  takes  the  spectrua  file  of  the  speech  file  and  perforas 
the  saae  task  as  eith  a  phoneae  spectrua  file,  plus  it 
expands  the  spectrua  file.  This  expansion  is  based  on  the 
aiqontha  used  ahen  perforaing  distance  aeasunng  aith  a 
speech  file  and  a  phoneae  teaplate  set  as  in  Seelandt's 
prograa.  The  speech  file  is  divHed  into  groups  of  vectors 
ehere  the  nuaber  of  vectors  per  group  is  equal  to  the  nuaber 
of  vectors  per  phoneae  teaplate.  There  is  also  an  overlap 
aith  the  groups  such  that  the  beginning  vector  of  a  one 

group  of  vectors  aas  the  second  vector  of  the  last  group 
of  vectors. 

The  resulting  spectrua  files  can  then  be  run  in  DRVR,  aaking  DRVR 
perfora  phoneae  recognition  using  1,  2,  and  4  vector  phoneaes  as 
teaplates. 


1 
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DIMENSION  IHEADERI 25d) ,  SPCHARY(25d) ,  ARRAYK640) ,  I F I LE <  13) ,  INEUFILU) 
TYPE'IS  THE  FILE  TO  BE  MODIFIED  FROM" 

TYPB"A  SPEECH  FILE  OR  A  PHONEME  FILE?" 

ACCEPT" (TYPE  0  FOR  SPEECH  OR  1  FOR  PHONEME):  ",1ANS 
TYPE  -  " 

TYPE"ENTER  NAME  OF  SPECTRUM  FILE" 

1 F ( IANS  >E8«  1)  ACCEPT'TO  BE  USED  AS  A  PHONEME  FILE:  " 

IP ( IANS  .EQ.  0)  ACCEPT-TO  BE  USED  AS  AN  OBSERVATION  FILE:  " 

READ (11 ,5)  IFILE(l) 

S  FORMAT ( S 1 3 ) 
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CALL  OPEN< 1 ;1F1LE ,1  ,IER1 ) 

CALL  CHECXdERl) 

ACCEPT'ENTER  HEW  FILE  NAHE:  “  {  Fite  naee  for  nee  spectrue  file. 

READ (11 >5)  IHEUF1L ( 1 ) 

ACCEPT" ENTER  VECTOR  LENGTH  OF  PHONEMES:  ”,  ILENGTH 

CALL  F0PEN(2,INEWF1L)  {Opens  channel  to  nee  spectrue  file. 

ISTART  ®  1  ;  Beginning  block  of  freq.  coeponents  in  old  file 

ACCEPT'NO.  COMPONENTS/  VECTOR  IN  NEW  FILE:  “  , ICOHP 

1CONT  =  1  ;  For  counting  •  of  32  point  vectors. 

IBLK  *  1  {  Starting  block  •  for  nee  file. 


CALL  RDBLK(l,0,lH£ADEMrIER0)  ;  Read  header  block. 

CALL  CHECK! IERO) 

IHEADER ( 57 )  =  ICOHP  ;  Change  e  of  freq.  coeponents 
ICHECX  *  ( IHEADER ( 56 )  -  ( ILENGTH  -  1)1  •  1LENCTH  ;  Checks  for 
{correct  e  of  32  paint  vectors  in  nee  file. 

IF( IANS  .EQ.  1)  CO  TO  9 

IF < IANS  .EQ.  0)  IHEADER ( 56 )  -  IHEADER ( 56 )  -  (ILENGTH  -  l)  ;  l  of 

i  ICOHP  point  vectors  is  speech  spectrue 
IF(IAHS  .EQ.  1)  I  HEADER ( 56 )  *  IHEADER (56) /ILENGTH  ;e  of  ICOHP 

{  point  vectors  in  phoneee  spectrue. 

TYPE'FILE  LENGTH  (IN  VECTORS)1  “  ,IHEADER(56) 

ACCEPT"D0  YOU  WANT  TO  CHANGE  THIS  VALUE?(1=Y/0  =  N>"  ,IANSA  ;  Can  be 
IFdANSA  .EQ.O)  GO  TO  11  {  changed  for  plotting  spectrue  using 

{SPENPLOT  prograe  <eust  also  change  ICOHP  to  32). 

ACCEPT'ENTER  FILE  LENGTH  VALUE:  H,ILENVAL  ;Should  be  the  saee 
IHEADER(56)  *  1LENVAL  {  as  ICHECK. 

CALL  WRBLK(2,0, IHEADER, 1,IER4)  ;  Write  header  block  to  nee  spectrue. 
CALL  CHECK(1ER4) 


CALL  RDBLX( 1 ,ISTART,SPCHARY,2,IER2)  ;Read  2  blocks  of  spectral  coeponents. 
IF ( IER2  .EQ.  9)  GO  TO  55  {If  the  end  of  file;  end  the  prograe. 

IJHP  =  1  {To  increeent  for  each  32nd  freq.  coeponent. 

INDEX  •  0  ;  Index  in  the  old  spectrue  array. 

1NCR  *  0  {Index  in  the  nee  spectrue  array. 

DO  20  Id,B 

SPCHARY( IJHP)  *  1.0  {Change  energy  value  to  a  constant. 

IJHP  -  IJHP  *32  ;Go  to  next  energy  value. 

IFdANS  .EQ.  0)  GO  TO  25 

CALL  URBLX(2;IBLX;SPCHARY;2;IER3)  {For  phoneee  spectrue;  erite  blocks 
1F( 1ER3  .HE.  1)  TYPE” WRBLI  ERROR  -  ”  ,IER3  {  to  nee  file. 

IBLK  *  IBLK  ♦  2  {  Increeent  to  next  teo  blocks  for  reading. 

ISTART  *  ISTART  ♦  2  {Return  to  read  2  Bore  blocks. 

GO  TO  10 


DO  30  11*1,4  {Do  for  4  groups  of  vectors. 

DO  40  J *  1 # ILENGTH  {Do  for  •  of  vectors  per  phoneee 
DO  50  J  J  *  1 ;32  {create  one  group. 

ARRAYHJJ+INCR)  *  SPCHARY( JJ+INDEX)  {Put  freq.  coeponents 

{in  nee  array. 

1C0NT  *  1C0NT  ♦  1  {Count  •  of  vectors  created  in  nee  file. 
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IFdCOHT  *EQ.  ICHECX)  60  TO  55  \  ChccK  to  sake  surt  correct 

^amount  of  vectors  have  been  created* 

IMCR  *  IHCR  ♦  32  ;Jump  over  last  set  of  components* 

INDEX  =  INDEX  +  32  ;  Increment  to  next  32  components* 

CONTINUE 

INDEX  -  II  •  32  ;  Set  to  begin  reading  in  components  for  next 

*  group  of  vectors. 

CALL  Wmi(2,IBLX,AftRAYi,UEN6TH,IER3>  ;  Read  in  •  of  blocks 
IF( IER3  .HE.  1)  TYPE'HRBLX  ERROR  «  “>IER3  ?  created  by  DO  loops. 

IBLX  *  IBLX  ♦  ILENGTH  ;  Increment  to  next  set  of  blocks. 

ISTART  *  ISTAR?  ♦  1  ;  Increment  1  block  to  read  in  next  teo  blocks. 

GO  TO  10  ;  Start  over  again. 

CALL  RESET 
STOP 


PROGRAM  SGRAM 


Pile: 

SGRAM 

Language : 

Fortran 

5 

Date: 

October 

20,  1982 

Author: 

K.  Seelandt,  modified  by  J.  Fletcher 

Subject: 

voice  spectrograms 

Call  Sequence: 

SGRAM 

ARGUMENT 

TYPE 

PURPOSE 

SFREQ 

Real 

value  set  for  the  sampling  fre¬ 
quency 

IHEIGHT 

Integer 

height  of  printed  characters 
per  frequency  component 

IWIDTH 

Integer 

width  of  printed  character 
per  frequency  component 

IPRE 

Integer 

width  for  performing  preemphasis 

IDB 

Integer 

number  of  dB  to  preemphasis 

FREQ 

Real 

starting  frequency  of  preemphasis 

ISIZE 

Integer 

number  of  samples  per  DFT 

ENTHRES 

Real 

energy  threshold  for  normal¬ 
ization 

IFILENAM 

String 

name  of  the  speechfile  to  be 
used 

I START 

Integer 

First  block  in  speechfile  to  be 
used  in  spectrogram 

ILAST 

Integer 

last  block  in  speechfile  to  be 
used  in  spectrogram 

IN 

String 

title  of  spectrogram 

RENERGY 

Real  Array 

holds  energy  for  each  spectral 
component  in  sentence 

DTARAY  Real  Array  holds  frequency  components  to 

be  used  in  DFT 

IDSP  Integer  Array  holds  a  block  of  components 

read  from  speechfile 

B  Real  Array  holds  windowed  frequency  com¬ 

ponents 

IBI  Integer  Array  holds  integer  values  of  fre¬ 

quency  components  after  DFT 
is  performed 

PURPOSE: 

This  program  is  used  to  plot  a  spectrogram  of  a  speech- 
file  using  a  64  point  DFT  and  either  Hamming  a  Rectangular 
window. 

DESCRIPTION: 

Location:  FLETCHER.  DR 

ARGUMENT  STRUCTURE: 

I HEIGHT  *  1  to  plot  normal  size  spectrogram 

IWIDTH  =  1  to  plot  normal  size  spectrogram 

IFILENAM  =  desired  speechfile  in  a  13  character 
Hollerith  string  format 

ISTART  =  0  to  start  at  beginning  of  speechfile 

ILAST  =  usually  last  block  of  the  speechfile 

must  not  exceed  file  block  size 

ISIZE  =  64  for  64  point  DFT 

ENTRES  =  value  can  be  set  by  user;  10  was  used 

exclusively  in  this  thesis 

IN  =  79  character  Hollerith  string  format; 

selected  by  user 

PROGRAM  USE: 

This  program  can  only  be  used  with  the  Printronix 
printer.  The  speechfile  used  should  be  in  block  format 


no  restrictions  on  size.  Be  sure  speechfile  is  digitized 
speech  and  not  a  spectrum  file.  The  loading  is  as  follows: 

RLDR  SGRAM  BYTEOUT  EDPT.LB  @FLIB@ 

BYTEOUT . RB  is  located  on  Fletcher's  tape  #1. 

FILES  CREATED: 

TENERGY:  contains  component  numbers  and  the  total 

energy  in  the  speechfile  for  each  spec¬ 
tral  component 

IENERGY :  contains  the  vector  number  and  the  total 

energy  for  each  vector 

DCOMPS:  contains  the  DFT  frequency  components  of 

the  vectors  each  vector  is  stored  as  a  record 
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PROGRAM:  S  G  R  A  H 


DATE:  20  OCT  02 

BY:  LT.  IARL  SEELANDT 
REVISED  BY:  CAPT  JAKES  E.  FLETCHER 
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COMPILE  AS  A  FORTRAN  V  PROGRAM 

«««••  This  program  eat  developed  by  LT*  HARK  FELKEY  during 
his  thesis  at  AFIT*  This  program  creates  a  spectrogram  of 
a  speech  file  by  finding  the  DFT  of  a  set  of  digitized 
speech  samples*  then  preeephasizing  the  high  frequency 
components  of  the  resultant  frequency  vector  (to  obtain 
spectrograms  that  look  as  ouch  as  possible  like  the  one's 
obtained  by  Potter*  Kopp  and  Green  in  Visible  Speech)* 

The  background  noise  is  effectively  reduced  by  threshold* 
ing  the  energy  of  eaci  frequency  vector  (total  energy  is 
equal  to:  square-root  of  the  sua  of  the  squares  of  individual 
frequency  coaponents).  This  threshold  value  is  obtained  by 
adjusting  the  value  to  remove  as  much  uneanted  signal  as 
is  possible*  while  not  degrading  the  recognition  results* 

Finally*  the  spectrogram  is  printed  out  on  the  printronix 
printer  using  a  grayscale  plotting  technique* 

This  program  has  been  modified  to  use  either  Hamming  windoming 
or  rectangular  eindoeing* 

INITIALIZE  VARIABLES 

DIMENSION  DTARAY(256) * ISYHB0L1 ( 10 ) *  I SYHB0L6 ( 10) *  I  DATE ( 3  > 

DIMENSION  B(256)  *IBI<120)  *  I  DSP  <  256 )  *RENERGYU4) 

DIMENSION  IFILENAH( 13) *lH(79) 

DIMENSION  ISYHB0L2( 10) * ISYMB0L3 ( 10 ) *ISYMB0L4( 10) *  I SYMBOLS! 10) 
C0MH0H/BLK/ 1SYHB0L1 *  I S Y MB0L2 *  I SYMBOL 3 , ISYHBUL4*ISYHB0L5  *ISYHB0L6 
COMPLEX  DTARAY 

CRITICAL  VARIABLES  DEFINED.  SOME  OF  THESE  HAY  BE  OPERATOR  ADJUSTED  BY 

OPTIONALLY  COMPILING  THE  PROGRAM  WHICH  COMPILES  THE  STATEMENTS  PRECEEDED  BY 

AN  'X'  IN  COLUMN  1  —  (USE  FORT/X  SCRAM). 

I SAMP  *  1  {COUNTS  THE  NUMBER  OF  VECTORS  CREATED* 

IC0UNT*1  {VARIABLE  THAT  KEEPS  TRACK  OF  WHEN  10  VECTORS  WERE  OUTPUT. 
ICHAN«3  {I/O  CHANNEL  FOR  SCRAM  OUTPUT  (USED  IN  SUBROUTINE  CALL) 
SFREQ'BOOO.O  {SAMPLING  FREQUENCY 
I HEIGHT » 1  {HEIGHT  OF  SCRAM  CHARACTERS 
IHIDTH«1  {WIDTH  OF  SGRAN  CHARACTERS 
IPRB • 1  {SIGNAL  TO  DO  PREEMPHASIS 
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IDB*6  (NUMBER  OF  BBS  TO  PREEHPHASIZE 
FREQ*500.0 (STARTING  FREE  FOR  PREERPHASIS 
IPRESS>2  {SIGNAL  TO  NOT  COMPRESS  VECTORS 
ISIZE*64  (NUMBER  OF  SAMPLES  PER  BFT 
EHTHRES*90.0  (ENERGY  THRESHOLD  FOR  NORMALIZATION 

PLOTTER  CHARACTERS  USED  BY  THE  PRINTROHIX  PLOTTER  FUNCTION. 

IPL0T*005X  (PLOT  COMMAND 

ILF*012K  (PRINT  LINE  OF  DATA  JUST  SENT. 

IDASH*177K  (DASH  USED  FOR  SCALE  OH  SGRAH 

IBLANX-0  (EMPTY  CHARACTER  USED  TO  HAKE  SURE  COMPLETE  WORD  IS  OUTPUT 

SPECTROGRAM  SYMBOLS  THAT  MAKE  UP  THE  GRAY-SCALE  CHARACTERS.  EACH  SYMBOL 
DEFINES  1  OF  6  DOT  LINES  FOR  10  LEVELS  OF  INTENSITY. 

DATA  I SYHB0L1/ 1001, 1001, 1001, 1221, 1221, 1221, 1221, 1661, 1771, 1771/ 

DATA  ISYHB0L2/100K, 1221, 1661, 1661, 1771, 1771, 1771, 1771, 1771, 1771/ 

DATA  ISYHB0L3/100K,100K,100K,100K,10OK,122X,133K,133K,133X,177K/ 

DATA  ISYHB0L4/100K,100K,100X,122X,122X,122K,122K,166X,177K,177K/ 

DATA  I SYMBOLS/ 100K,122K  .166K,166K, 1771,1771, 177K , 177K,177K,177K/ 

DATA  ISYHB0L6/100X, 1001,1001, 1001, 100K,122K, 1331,1331, 1331,1771/ 

INPUT  CONTROL  VARIABLES,  OPEN  FILES,  4  PRINT  HEADING  OH  SCRAM. 

CALL  F0PEN<1,"TEHERGY">  (FILE  TO  INDICATE  RELATIVE  TOTAL 

(ENERGY  OF  SENTENCE. 

CALL  F0PEH<4,"IENERCY")  (FILE  FOR  INDIVIDUAL  ENERGY  OF  EACH  VECTOR. 

INPUT  NAME  OF  FILE  THAT  THE  SGRAH  HILL  BE  HADE  FROH-FILENAN 

ACCEPT'ENTER  FILENAME  TO  BE  SGRAH'  ED  :  " 

READ! 11 ,669) IFILENAHl 1 ) 

669  F0RNAT(S13) 


CALL  0PEN(5,IFILENAM,1,512,IER>  (FILE  THAT  CONTAINS  DIG.  SPEECH 
IF ( IER . HE • 1 )  TYPE  "ERROR  ON  OPEN,IER*" ,IER 
(CHECK  TO  SEE  IF  SGRAM  IS  DESIRED. 

ACCEPT  "CONSTRUCT  SPECTROGRAM?  < l=YES/2-H0) ,ISPEC 
IF  ( I SPEC • NE • 1 >  GOTO  591 

ACCEPT  "SEND  SGRAH  TO  PRINTER?  (YS1,N*2) :“,IREPLY 
IF  (IREPLY.NE.l)  GO  TO  5B0 
CALL  F0PEN(3,"4LPT“) 

CO  TO  590 
( 

510  CALL  F0PEN(3, "SYMBOLS", "B")  (IF  SGRAH  ISN'T  SENT  TO  PRINTER, STORE 

(CHARACTERS  IN  THIS  FILE  FOR  OUTPUT  LATER. 

590  CONTINUE 

X  ACCEPT  "HEIGHT  OF  SYMBOLS  OF  SGRAH?", IHEIGHT 

X  ACCEPT  "HIDTN  OF  SYMBOLS  OF  SGRAH?" ,IUIDTH 

591  ACCEPT  "FIRST  BLOC!  TO  READ*" ,ISTART 

ACCEPT  "LAST  BLOCK  TO  READ*", I LAST 
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ACCEPT  "NUMBER  OF  SAMPLES  PER  VECTOR  (PONER  OF  2>*",ISIZE 
ACCEPT"BO  YOU  WANT  A  HAMMING  yiMDOH?U*Y/0*N)  ",IHAM 
ISIZEH'ISIZE/2  {HUMBER  OF  COMPONENTS  PER  VECTOR 
OPEN  2,"DC0HPS‘,LEN*1SIZE  ;  STORES  THE  FREQ.  COMPS.  OF  EACH  VECTOR 
ACCEPT  "ENERCY  THRESHOLD *",ENTHRES 

IBYTE*999  {INITIALIZE  SUBROUTINE  BYTEOUT  WITH  THIS  VARIABLE 
CALL  BYTEOUT < ICHAN , IUYTE ) 

IF  (ISPEC.NE.l)  GOTO  599 
• 

{PRINT  HEADING  ON  SPECTROGRAM. 

; 

ACCEPT" ENTER  TITLE  OF  SGRAH  (79  CHAR.  OR  LESSX15)  “ 

RE AO (11 / 69 ) IN  < 1 ) 

F0RHAKS79) 

WRITE ( 3,70)18(1) 

FQRMAT(1X,“  “//“  mom****"  ,S79,  "•«*»»*#»«»  ”//“  “> 

CALL  DATE(IDATE,IER) 

IFC IER.NE.1)  TYPE  “ERROR  ON  DATE,IER*“ ,IER 
CALL  FGT1HE(IHQUR,IHIH,ISEC) 

WRITE ( 3 /5V2 )  (IDATEt I) ,1*1,3) 

F  ORNAT (1 X ,  ” DATE 213,15) 

WRITE ( 3 ^ 593 >  IH0UR,IM1N,ISEC 
FORMAT (IX; "TIME:" ,313) 

URITE(3;S95)  I S I ZE , I START , I LAST 

FORMAT ( IX , "HO ,  OF  SAMPLES*", 13,"  FIRST  BLOCK*", 13,"  LAST  BLOCK*", 
13) 

IF(IPRE.EQ.l)  WRITE (3 ,596 )  IDB 

F0RHAT(1X,’HIGH  FREQUENCIES  PREEMPHASIZED ,  DB/OCTAVE*" ,13) 

IF(IPRE.EQ.l)  VRITE(3,597)  FREQ 

FORMAK IX, "PREEMPHASIS  STARTS  AT  FREQ:”,F5.0) 

WRITE (3 >60 1 )  ENTHRES 

FORHATt IX, "ENERCY  THRESHOLD*”, F6. 1) 


JOE- 1 

WRITE(3,769) JOE 
FORMAT ("  “//" 


rll/fMHHHHIfmHIH 
!*»»»«  "//"  ") 


{THIS  SECTION  OF  THE  PROGRAM  READS  IN  DIG. SPEECH  AND  CREATES  SCRAN. 
\ 

1 

CONTINUE 

DO  60S  1*1, ISIZEH 

RENERGY(I)*0.0  {CLEAR  ARRAY  THAT  ADDS  UP  ENERCY  OF  SENTENCE 
1 

{MAIN  LOOP  OF  PROGRAM. 

\ 

DO  1000  1H*ISTART,ILAST 
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{INPUT  SPEECH  AND  PROCESS  IT. 

CALL  RDBLK(5,IM,IDSP,1 ,IER) 

IF(IER.ME.l)  TYPE  "ERROR  OH  RDBLI,IER*",IER 
J1>0 

400  CONTINUE 

{FORM  COMPLEX  ARRAY  THAT  HILL  BE  USED  FOR  DFT. 

DO  410  J -1 f IS1ZE 
B( J)*IDSP( JtJl) 

UNDO  *  1.0  {Defaults  to  rectangular  eindoe 

I F ( I HAH  .EQ.  1)  HND0*0. 54 -0.44*COS<  2. 0»3. 14 15924534* 

/  (J-ll/ISIZE)  {Equation  for  Hanning  eindoe 

B< J)  *  UNDO  *  B(J) 

DTARAY( J)*B( J) 

410  CONTINUE 

{DO  DFT,  LARAY*»  OF  POINTS,  INVER  SPECIFIES  FORHARD  TRANSFORM 

LARAY-ISIZE 

INVERT 

CALL  DFT5(DTARAY,LARAY, INVER) 

{FIND  MAGNITUDE  OF  DFT  RESULTS  TO  GET  FREQ.  COMPONENTS. 

DO  425  1  =  1 , ISIZEH 
B  ( I ) *  CABS ( DTARAY ( I ) /LARAY ) 

I F < I HAH  .EQ.  1)  B<I)-B(I>/0«54 
425  CONTINUE 

PREEHPHASIZE  HIGH  FREQUENCY  COMPONENTS. 

IF  <IPRE.NE.11  GO  TO  45 

{FIND  1ST  COMPONENT  TO  START  PREEMPHASIS  AT. 

IFREQ1=IFIX(FREQ/SFREQ«ISIZE>+1 

DO  700  I=IFREQ1 , ISIZEH 

{CONVERT  INTECER  VARIABLES  TO  REALS. 

RI*I 

RIFREQ1-IFREQ1 
RIDBa IDB 

{MODIFY  MAGNITUDE  OF  COMPONENTS. 

B(I)-B<I)*(10»*( RIDB/20*ALOG 104RI/RIFREQ1) /AL0G10 ( 2.01) ) 

700  CONTINUE 

FREQUENCY  VECTOR  COMPRESSED  TO  14  CHANNELS  IF  COMPRESSION  HAS  SELECTED. 

45  IF  (IPKESS.NE.il  GO  TO  30 

{FIND  NUMBER  OF  COMPONENTS  TO  COMBINE  INTO  1  COMPONENT 

JNUM*ISIZEH/14 

JJ1*1 

JJ2*JHUM 

DO  20  1*1,14 

SUM*0.0 

DO  10  J=JJ1  ,JJ2 
10  SUM*SUH+B< J) 

B< I ) *SUH 
JJl*JJltJNUM 
JJ2*JJ2+JHUH 
20  CONTINUE 

C 


n  n  o  o  o  o 


C  NORMALIZATION  OF  ENERGY  IN  VECTORS. 

C 

30  IF  (IFRESS.EQ.l)  ISIZEH=16 

SUHE--0.0 

DO  33  J*1 , ISIZEH 
SUHE*SUHE+(B(J)»«2) 

33  CONTINUE 
ENERCY*SQRT(SUME) 

URITE(4)  ISAMPP, ENERGY  {SAVE  INDIVIDUAL  ENERGYS  OF  VECTORS. 
ISAMPP* ISAMP+1 

IF  (ENERGY. CT.ENTHRES)  GO  TO  32 
{INCREASE  VALUE  OF  ENERGY  TO  ATTENUATE  COMPONENTS 
ENERGY*5»ENTHRES 
32  CONTINUE 

IMULT*32747  {RANGE  OF  CONPONENTS  AFTER  NORMALIZATION. 

DO  34  J*l, ISIZEH 
B ( J)*(B(J) /ENERGY) *IHULT 

{  SAVE  TOTAL  ENERGY  FOR  EACH  COMPONENT  FOR  THIS  UTTERANCE. 

RENERCYl J)*RENERGY( J)+B(J)  {SUM  UP  TOTAL  ENERGY 

34  CONTINUE 

ARRAY  VALUES  CONVERTED  TO  INTEGER  FORM 

DO  240  JJ  =  1, ISIZEH 
IBI( JJ)=IFIX(B< JJ)) 

240  CONTINUE 

% 

{SAVE  VECTOR  COMPONENTS  FOR  USE  LATER  BY  ' PPGEN' 

5 

CALL  URITR(2,ISAHP,IBI,1,IER) 

IF(IER.NE.l)  TYPE  "ERROR  ON  HRITR,IER*" ,IER 
• 

{IF  SGRAN  IS  NOT  DESIRED, SUP  OVER  SECTION  TO  SCALE  THE  CONPONENTS 

{TO  MAX.  VALUES  OF  10  AND  THEN  OUTPUT  THE  SGRAN  CHARACTERS. 

• 

IF  (ISPEC.NE.l)  GOTO  330 
DO  245  1=1 ,ISIZEH 

IBI(I)=IFIX(FL0AT(IBI(I))/32747*10)+1 

{MAKE  SURE  COMPONENT  VALUES  ARE  IN  CORRECT  RANGE  (1-10). 

IF  (IBI(I).LE.O)  IBI ( I )  =  1 
IFdBI(I)  .GT.10)  I B 1 1 1 )  =  1 0 
245  CONTINUE 

DETERMINE  AND  OUTPUT  THE  CHARACTERS  THAT  HILL  CONSTRUCT  THE  SPECTROGRAM 

DO  320  1*1 ,INIDTH  {REPEAT  SYMBOLS  IHIDTH  NUMBER  OF  TIMES. 

CALL  BYTEOUT ( ICHAN , IPLOT )  {SEND  PLOT  COMMAND  TO  PRINTRONIX 
{SEND  1ST  DOT  RON  FOR  ALL  THE  COMPONENTS 
DO  250  JJ*1,ISIZEH 
JS*IBI( JJ) 

DO  250  J*1 ,IHEI6HT 

CALL  BYTEOUT (ICHAN , ISYMB0L1 ( JS) ) 

250  CONTINUE 

CALL  BYTEOUT(ICHAN,ILF)  {SEND  LINE  FEED  TO  TERMINATE  LINE 


o  o  o 


l> 


CALL  BYTEOUTdCHAN,  IPLQT) 

;SEHI  2ND  SOT  ROM  FOR  ALL  THE  COMPONENTS 

SO  260  JJ*1 jISIZEH 

JS»IBI(JJ) 

SO  260  Jal ,IHE1GHT 

CALL  BYTE0UT(ICHAN,ISYMB0L2( JS) ) 

260  CONTINUE 

CALL  BYTEOUTdCHAN, ILF) 

CALL  BYTEOUTdCHAN,  1PL0T) 

; SE H D  3RD  DOT  ROM  FOR  ALL  THE  COMPONENTS 
DO  270  J J s 1 ,ISIZEH 
JS=IBI( JJ) 

DO  270  Jal,IHEIGHT 

CALL  BYTEOUTdCHAN, ISYMB0L3(JS>) 

270  CONTINUE 

CALL  BYTEOUTdCHAN, ILF) 

CALL  BYTEOUT ( ICHAN , 1PL0T ) 

; S END  4TH  DOT  ROM  FOR  ALL  THE  COMPONENTS 

DO  2B0  J J  =  1 , ISIZEH 

JS*IBI(JJ> 

DO  2S0  Jd,lHEICHT 

CALL  BYTE0UT(ICHAN,1SYMB0L4( JS) ) 

260  CONTINUE 

CALL  BYTEOUT (ICHAN >ILF) 

CALL  BYTEOUTdCHAN, IPLOT) 

;SEND  5TH  DOT  ROM  FOR  ALL  THE  COMPONENTS 

DO  290  J  J  =  1 , ISIZEH 

JS-IBI(JJ) 

DO  290  Jd/IHEIGHT 

CALL  BYTE0UT(ICHAN,ISYMB0L5( JS) ) 

290  CONTINUE 

CALL  BYTEOUTdCHAN, ILF) 

CALL  BYTEOUTdCHAN, IPLOT) 

;SEND  6TH  DOT  ROM  FOR  ALL  THE  COMPONENTS 

DO  300  JJ*l, ISIZEH 

JSdBKJJ) 

DO  300  J*1  ,IHEICHT 

CALL  BYTEOUTdCHAN, I S YHB0L6 < JS > > 

300  CONTINUE 

;PRINT  A  DASH  AFTER  EVERY  10  VECTORS 
IF  (IC0UNT.NE.10)  GOTO  310 
CALL  BYTEOUTdCHAN, IDASH) 

ICOUNT*0 

310  CALL  BYTEOUTdCHAN, ILF) 

320  CONTINUE 

CALL  BYTEOUTdCHAN, IBLANE);SEND  A  BLANI  TO  INSURE  LAST  CHARACTER  SENT 
330  IC0UNT*1C0UNT*1 

^RESTORE  ISIZEH  INCASE  IT  HAS  CHANCED  FOR  COMPRESSION 
lSIZEH’ISIZE/2 

END  OF  SCRAM  CONSTRUCTION 
ISAHP‘ISAMP+1 

;CHECK  TO  SEE  IF  ANOTHER  VECTOR  CAN  BE  FORMED  FROM  THIS  BLOCK  OF  SPEECH 
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J1«JHISIZE 

IP  (J1.LT.236)  CO  TO  400 

CONTINUE 

• 

•'end  OF  MAIN  LOOP  OF  PROCRAIt 
• 

DO  1010  I s 1 ^ ISIZ1H 

WRITE ( 1 )  I »RENERCt (1)  {SAVE  TOTAL  ENERGY  ARRAY 
CALL  RESET 

URITE(10r<7>V<7>V<7>"  CRIHC  BELL  TO  UAKE  UP  OPERATOR 


PROGRAM  TEKTALK 


File: 

TEKTALK 

Language : 

Fortran 

5 

Date: 

October 

20/  1982 

Author : 

K.  Seelandt,  modified  by  J.  Fletcher 

Subject: 

Digitized  speech  output 

Calling  Sequence:  TEKTALK 

ARGUMENT 

TYPE 

PURPOSE 

SFREQ 

Real 

Value  set  for  the  sampling 
frequency 

IDB 

Integer 

Number  of  DB  to  preemphasize 

FREQ 

Real 

Starting  frequency  of  pre¬ 
emphasize 

ISIZE 

Integer 

DFT  sampling  size 

ENTHRES 

Real 

energy  threshold  for  normal¬ 
ization 

PARAM  (1)  & 
FILE  NAM 

String 

name  of  desired  speechfile 

TITLE 

String 

Title  for  spectrogram 

I START 

Integer 

First  block  in  speechfile  to 
be  used  in  spectrogram 

ILAST 

Integer 

Last  block  in  speechfile  to 
be  used  in  spectrogram 

DTARAY 

Real  Array 

Holds  frequency  components  to 
be  used  in  DFT 

RENERGY 

Real  Array 

Holds  energy  for  each  spectral 
component  in  the  sentence 

B 

Real  Array 

Holds  windowed  frequency  com¬ 
ponents 
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IBI 

Integer 

Holds  integer  values  of  frequency 
components  after  DPT  is  performed 

IDSP 

Integer  Array 

Holds  a  block  of  components  read 
from  speechfile 

ICONTROL 

Integer  Array 

Holds  screen  command  for  TEK- 
tronix  1040  terminal 

IH 

Integer  Array 

Holds  position  of  cross  hairs 

IL 

Integer  Array 

Holds  position  of  cross  hairs 

IMASK 

Integer  Array 

Holds  constants  used  for 

ANDing  and  ORing 

PARAM 

(8) 

Real 

Starting  position  to  heard  in 
speechfile 

PARAM 

(9) 

Real 

Ending  position  to  heard  in 
speechfile 

PARAM 

CIO) 

Real 

Number  of  times  segement  to 
be  heard 

PURPOSE: 

This  program  is  used  to  plot  the  spectrogram  and  allow 
the  user  to  listen  to  any  part  of  the  speechfile  he  wants, 
as  many  times  as  he  wants. 

DESCRIPTION 

Location:  FLETCHER.  DR 
ARGUMENT  STRUCTURE: 

PARAM  Cl)&  =  name  of  speechfile  in  13  character 
FILENAM  Hollerith  string  format 

TITLE  =  title  for  the  spectrogram  in  79  character 

Hollerith  string  format 

ISTART  =  starting  block  of  speechfile  to  be  shown 

using  spectrogram 

I LAST  =  last  block  of  speechfile  for  spectrogram; 

total  number  of  blocks  must  not  exceed  20 

IDB  =  6dB  was  used  exclusively  in  this  thesis 


FREQ  *  500.0  was  used  exclusively 

ISIZE  =  number  of  samples  for  each  time  slice , 

64  was  used  exclusively 

ENTHRES  =  threshold  for  energy  the  value  10  was  used 

PROGRAM  USE: 

This  program  can  be  used  only  on  the  Tedtronix  4010 
terminal  using  the  Data  General  A/D/A  converter.  The  A/D/A 
converter  must  be  set  up  prior  to  the  running  of  this  program 
as  per  instructions  in  the  thesis  text.  The  speech  output  is 
through  the  Rockland  filter  and  Crown  amplifier.  The  user  can 
select  any  segement  of  speech  he  wants  to  listen  to  by 
displaying  up  to  20  blocks  of  speech  on  the  terminal  and  using 
the  cross  hairs  to  delimit  the  segment  of  speech  he  wants  to 
hear  and  the  number  of  times  he  wants  it  repeated.  This  program 
is  especially  useful  for  listening  to  speech  segments  to 
pick  phoneme  prototypes.  To  load  this  program,  use  the 
following  command: 

RLDR  TEKTALK  BYTEPAC  TEKDOT  TEKLINE  TEKTONE 

EDFT.LB  @FLIB@ 

Also,  make  sure  TALK . SV  is  in  the  same  directory  since  this 
is  swapped  with  TEKTALK  to  output  the  speech. 

FILES  CREATED: 

TENERGY:  contains  component  number  and  the  total 

energy  in  the  speechfile  for  each  spectral 
component 

IENERGY :  contains  the  vector  number  and  the  total 

energy  for  each  vector 
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DCOMPS:  contains  the  DFT  frequency  components  of  the 

vectors;  each  vector  is  stored  as  a  record 

PARAM.AO:  contains  file  name  of  speech,  starting  and 

ending  work  of  segment  of  speech  to  be  heard 
and  number  of  times  to  repeat  it;  data  writ¬ 
ten  sequentially  in  the  file  using  WRSEQ 


c 

HOI 

taaaa 

■< 

c 

aaeei 

mean 

c 

an 

aa 

- - . 

c 

as 

PROGRAM:  T  E  1  T  A  L  K 

aa 

c 

aa 

aa 

c 

aa 

WRITTEN  BY:  LT  XARL  SEELANDT 

aa 

c 

aa 

aa 

aa 

REVISED  BY:  CAPT  JANES  E.  FLETCHER 

aa 

aa 

** 

^ _ _ 

c 

aa 

DATE:  12  OCT.  ‘82 

aa 

c 

aa 

aa 

c 

aaaai 

laaaa 

c 

aaaai 

taaaa 

c 

c 

Load 

TEKTONE,  TEKDOT;  TEKLINE,  BYTEPAC  and  EDFT.LB 

-  • 

C  along  aith  this  program  using  the  RLDR  connand. 

C 

C  NOTE:  THIS  ROUTINE  IS  FOR  FORTRAN  5  !! 

C 

The  spectrograa  portion  of  this  prograa  was  derived  froa 
C  a  prograa  aritten  by  Lt.  HarK  FelKey  ehile  aorking  on  his 

C  thesis  here  at  AFIT, 

C 

C»*»»»»  This  prograa  ai 1 1  accept  as  an  input;  a  file  containing 
C  froa  1  to  AS  blocks  of  digitized  speech.  The  output  ail) 

C  be  a  spectrograa  of  a  portion  of  the  file,  aith  a  length  up  to 

C  SO  vectors;  or  0.64  seconds  of  speech  (aith  a  64  saaple  OFT), 

C  Again  if  the  64  saaple  DFT  is  used  the  frequency  spectrue  of 

C  the  spectrograa  consists  of  32  separate  levels  froa  DC  to  4V.Hz, 

C  When  this  prograa  uses  the  64  saaple  DFT  the  vectors  have  a 

C  length  of  Sasec  since  the  saapling  rate  of  the  analog  speech 

C  signal  is  SKHz,  The  aaxiaua  length  of  a  speechfile  is 

C  currently  SS  blocks  /  2252S  eords,  this  equates  to  a  aaxiaua 

C  spectrograa  length  of  352  vectors  (  8  22528  /  64  ), 

C 

After  taking  the  DFT  of  the  digitized  speech  the  high 
C  frequency  coaponents  of  each  tiae  vector  are  preeaphasized ; 

C  the  coaponents  of  the  vectors  are  then  energy  thresholded 

C  and  finally  they  are  energy  noraalized  on  a  scale 

C  of  0  thru  9  for  gray-scale  encoding.  After  all  this  has 

C  been  done  the  spectrograa  is  displayed  on  the  Tektronix 

C  4010  Graphics  Teraina),  Once  the  spectrograa  is  coapleted 

C  the  teraina)  is  left  in  the  GIH-aode  and  ail)  display  the 

C  cursors.  The  cursors  can  be  adjusted  by  the  operator  to 

C  input  too  X-values  froa  the  screen  to  the  coaputer;  they 

C  deteraine  ahat  portion  of  the  spectrograa  ail  I  be  heard  ahen 

C  using  the  audio  output  prograa,  This  alloes  the  operator  to 

C  listen  to  any  segaent  of  speech  displayed  on  the  spectrograa 

C  enabling  hia/her  to  deteraine  an  optiaal  set  of  prototype 

C  phoneaes  to  be  used  for  the  recognition  of  phoneaes  in 

C  connected  speech, 

C 

C 

Caeaeee  NOTE:  In  addition  to  the  above  listed  inforaation 
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this  routine  ■ill  also  illoi  the  user  to  choose  the 
of  mindoming  used  i«  creatinq  the  spectrogram.  As  the 
operator  requests)  the  eindoa  eiM  be  a  rectanqular  non* 
overlapping  one  or  a  Hanainq  non-over  1 appinqon*. 

The  eindoe  multiplier  for  a  rectanqular  eindoe  is  1* 

The  eindoe  eultiplier  used  for  the  Hamming  eindoe  is 
the  saee  as  that  eentioned  by  Oppenhiee  l  Shafer  in 
Digital  Signal  Processing)  on  page  242. 

Also  note  that  ehen  using  other  than  a 
rectanqular  eindoe  the  frequency  components  should 
be  multiplied  be  some  multiplication  factor  in 
order  to  Keep  the  relative  magnitudes  the  same. 

For  a  Hamming  eindoe  this  factor  is  1/0.54  and  is 
done  right  after  the  call  to  the  DFT  routine. 


The  number  of  tines  a  segment  of  speech  is  heard  can  be 
operator  adjusted  from  the  TeKtronix  terminal  by  the  Keys 
hit  to  send  the  cursor  information  about  the  final  position 
of  the  teo  positions  sent  to  hear  a  segment.  For  example 
hitting  a  1  then  9  mill  alloe  the  user  to  hear  the  speech 
sequent  19  times;  then  the  cursors  are  displayed  again  and 
the  process  can  be  continued. 


******  Set  up  variables  to  be  used. 

DIHENSIOH  IH(2>  ;IL(2)  ;IM0RD1(2) ; I U0RD2 ( 2  > ,I(3),INASK(3) 
DIMENSION  DTARAY(25i)  ,IDATE<3) ,B(256) ,IBI ( 12b) ,IDSP(25«) 
DIMENSION  I  CONTROL! 7) ,RENERCY(64) ; I  STORE ( 13) 

INTEGER  FILENAM<13),TITLE(7?>,PARAM110) 

DATA  INASK/37K)17400K)17K/ 

DATA  IC0NTR0L/033X ,0141 ,0321 ,005K ,035K ,0371 ,0071/ 

•«  ASCII  equivalent?  ESC  FF  SUB  EMQ  GS  US  BEL. 

COMPLEX  DTARAY 
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C 


Define  critical  variables  used  in  the  proqrae,  soee 
variables  can  be  user-defined  by  optionally  compiling 


statements  preceeded  by  an  ' 

I F  L AC  -  0 
ISAHPsO 
I PRE* 1 
IDB*4 

SFREQ*6000.0 

FREQ*500.0 

ISIZE*44 

IHAM«0 

qaill 


using  the  FORT/X  command. 

?mhat  job  is  next. 

^counts  vectors  created, 
^signal  to  preemphasize. 
?DB‘s  to  preemphasize  by. 
^sampling  frequency 
;start  preemphasis  here. 
gDPT  sample  size. 

;mhat  type  of  mindoming 
be  done  default  *  rectang. 


non  on  o  n  o 


ISYTE*P?9  fini t •  output  routine. 

CALL  lYTEPAC(lBYTE)  \ 

A  CALL  RESET 

•onn  Input  control  variables  and  print  a  heading. 

OPEN  1;"TENERCY“  {contains  total  energy  of 

tspeechfile  at  each  frog. 

OPEN  4;"IEHERGY"  contains  energy  of 

;each  individual  vector. 

•*•••  Hake  available  none  basic  instructions  for  the  user  and 
input  the  naae  of  the  speechfile  to  be  used. 
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TYPE“<15>  < 15>  " 

TYPE"  HELCOHE  TO  TEKTALK" 

TYPE"  mmimm  »»«  imHiimn  <15>" 

TYPE"  developed  by: 

TYPE"  LT  EARL  G.  SEELANDT 

TYPE"  revised  by:  " 

TYPE"  CAPT  JANES  E.  FLETCHER  <15X151  " 

ACCEPT"90  YOU  WANT  TO  CONTINUE/CET  HORE  INFO 
/  ;  ENTER  1/0  ",IC0HT 

C 

IF ( ICONT . F" . 1 ) GO  TO  S 
CALL  BYTEPACdCONTROLU)) 

CALL  BYTEP AC ( I COHTROL ( 2 ) ) 

CALL  FDELAY ( 10) 

C 

TYPE"  This  routine  eas  developed  as  an" 

TYPE"  interactive  routine  to  be  used  as  a  tool" 

TYPE"  that  nould  enable  a  user  to  nork  in  the" 

TYPE"  area  of  prototype  phonene  devetopenent." 

TYPE"  TEKTALE  alloes  a  user  to  simultaneously" 

TYPE"  see  a  picture  of  speech  (a  spectrograo)" 

TYPE"  ehile  hearing  different  segments  of  the" 

TYPE"  sane.  <15>  <15> 

TYPE"  For  this  program  to  eork  lake  sure" 

TYPE"  the  fo) toning  rules  are  follooed:  " 

TYPE"  1.... TALE. SV  has  been  conpiled  and  loaded" 

TYPE"  in  your  directory.  " 

TYPE"  2«...Hake  sure  DP4F:SAH  has  been  initialized" 
TYPE”  3.. ..The  proper  hardeare  connections  have" 

TYPE"  been  made  to  the  A/D/A  converter." 

TYPE"  4.... Hake  sure  the  cables  have  been  snitched" 
TYPE"  in  the  back  of  the  Croon  amplifier  and" 

TYPE"  Rockland  Hi/Lo  pass  filter." 

TYPE"  S....FILE  SPACE  must  be  available;  about" 

TYPE"  100  to  200  blocks.  " 

ACCEPT'TO  CONTINUE;  HIT  RETURN." 

REA»(U;2)  IHOLD 
2  FORNAX  SI) 

CALL  BYTEPACdCONTROL(l))  \  Clear  screen  and  hone  char 
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CALL  BYTEPAC(IC0HTR0L<2) )  ;  pointer  to  prevent  overwriting. 
CALL  FDELAY(IO) 

ACCEPT’  ENTER  HARE  OP  SPEECHFILE:  ’ 

READ< 11 / 1 0  >  PARAH ( 1 ) 

F0RHAT(S13) 

DO  1  J*1 ,7 

FILENAH(J)  «  PARAH(J) 

CONTINUE 


ACCEPT’ENTER  TITLE  FOR  SCRAN  (79  characters  or  less>:<15>" 
READ (11 ,11)T1TL£(1) 

F0RHAKS79) 

CALL  0PEN(5,FILENAN,1,512,1ER1>  ^channel  to  speechfile 
IF ( I ER1 .ME. 1 ) TYPE" ERROR  ON  OPEN,  1ER1-  ",IER1 


TYPE"N0TE:  You  oust  do  a  spectrograe  of  20  blocks" 
TYPE"  or  less  (SO  or  less  vectors),  so  that" 
TYPE”  it  nil)  fit  on  the  Tektronix  screen  !!’ 

TYPE"  This  equals  0.64  seconds  of  speech,  tax." 

TYPE" 

ACCEPT’ENTER  FIRST  SLOCK  TO  BE  REAS  :  ~  f I  START 

ACCEPT'ENTER  LAST  BLOCK  TO  BE  READ  :  “,ILAST 
TYPE'UHAT  TYPE  OF  HINDOO  IS  DESIRED’ 

ACCEPT’HANHINC  /  RECTANCULAR  (1  /  0):  ” ,IHAH 


ACCEPT-PREENPHASIZE  HIGH  FRiQ.?  (1*YES/  2=N0):  ”,IPRE 
IF ( IPRE . EQ • 1 ) ACCEPT "ENTER  DBs  per  OCTAVE  :  ",I0B 
IFdPRE.EO.  1 )  ACCEPT  "ENTER  STARTING  FREQ.  ;  "  ,FRE0 
ACCEPT’ENTER  DFT  SAMPLE  SIZE  (-  saepl es/vector )  :”,ISIZE 


ISIZEH-ISIZE/2  (number  of  distinct  freq. 


OPEN  19,“DC0HPS”,LEN=ISIZE  (store  freq  coeponents. 


ACCEPT 

"ENTER 

ENERGY  THRESHOLD  :  “ ,ENTHRES 

TYPE" 

fVDB  M 

(15)  " 

lire 

TYPE" 

wwwwkw  n  U  1  1  b  6  wwwwww  \ IJ / 

While  in  the  interactive  node,  EACH  tine’ 

TYPE" 

you 

listen  to  a  portion  of  a  spectrogram  the 

TYPE" 

following  information  it  written  to  a  file:(15>“ 

TYPE" 

1. 

Date  and  time* 

TYPE" 

2. 

Name  of  speechfile. 

TYPE” 

3. 

Spectrogram  heading# 

TYPE’ 

4. 

And  for  each  segment# 

TYPE" 

a. 

A  numbering# 

TYPE" 

b. 

Position  of  the  first 

o  o  o  r>  o  o  n  r»oc->orsc-»oi-> 


TYPI"  vector  heard  (in  oords)." 

TYPE"  Ci  Nunber  of  oords  heard •  " 

TYPI"  di  Segoeat  length  (sec)#  <13>" 

ACCEPT'TO  CONTINUE ,  HIT  RETURN" 

REAB(11,2>  IH0LI1 
CALL  BYTEPACdCOHTROLd) ) 

CALL  BYTEPAC(IC0NTR0L(2) ) 

CALL  PDELAY(IO) 

C 

C 

TYPE“(15>  (IS)  " 

TYPE"  WHILE  IN  THE  INTERACTIVE  NODE*  “ 

TYPE"  " 

TYPE"  Perfora  following  steps  on  Tektronix  teroinals" 
TYPE"  <15>  " 

TYPE"li  Position  crosshairs  at  FRONT  of  first  speech  “ 
TYPE"  vector  you  oant  to  hear#" 

TYPE"2i  Hit  ttice  any  key  (on  Tektronix)  to  send  cursor" 
TYPE"  position  to  the  cooputer." 

TYPE"3#  The  cursor  is  displayed  after  a  beep,  and  you  " 
TYPE"  should  noo  position  crosshairs  BEHIND  the  last  " 
TYPE"  vector  you  oant  to  hear." 

TYPE"4.  Next  hit  the  keys  that  represent  the  nueber" 
TYPE"  of  tines  you  oant  to  hear  the  section  of  speech" 
TYPE*  just  narked  off,  fron  1  thru  99  tines. 

TYPE"  For  exaaple,  entering  a  1  then  a  5  nil)  let" 

TYPE"  you  hear  the  segnent  15  tines.  (15)  " 

TYPE"«»«To  STOP  send  sane  crosshair  position  toice.  " 
TYPE"(15>  (15)  " 

ACCEPT"T0  CONTINUE,  HIT  RETURN.” 

READ( 11 ,2)  1H0LD2 

Print  a  heading  on  the  Tektronix  screen. 


«•«•««  For  too  character  conoands  oust  send  both  in 
one  oord,  so  use  BYTEPAC  to  pack  too  bytes  into 
one  neoory  oord.  Can  just  use  a  WRITE  BINARY 
for  one  character  (ASCII)  conoands. 

31  CALL  BYTEPACdCONTROL(l) )  {send  ESC  FF 

CALL  BYTEPAC( IC0NTRQL( 2) )  ;to  blank  screen 

CALL  DATE( IDATE,1ER3)  ;gtt  the  date. 

IF (1ER3.NE.1) TYPE "ERROR  ON  DATE,  IER3-  ",IER3 

CALL  F6TIHE(IH0UR,ININ,ISEC)  ;get  the  tine. 

oeeeee  To  output  heading  send  position  that  oil l  be  the 
loner  left  corner  of  first  character,  then  go  back  to 
the  Alpha  node  and  nrite  this  portion  of  the  heading. 

CALL  FDELAY(20)  {tine  delay  to  alloo 

{screen  to  clear. 

CALL  TEID0T(740,200)  {start  of  line. 
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WRITE  BINARY  (10)  ICONTROLU)  {to  qet  to  Alpha 
WRITE (10/ 100) TITLE! 1)  {send  tltlt 

FORMAT (S7T)  { 


CALL  TEXDOT(710,200)  {start  of  next  lino 

WRITE  BINARY  (10)  ICONTROLU)  {40  to  Alpha  oodo 
WRITE (10/1 10) FILENAR(l)  {send  filenaoe 

FORMAT! "THIS  IS  A  SPECTROGRAM  OF:  ",S13) 

CALL  TEKDOTU80/200)  {start  of  next  lino 

WRITE  BINARY  (10)  ICONTROLU)  ;qo  to  Alpha  node. 

WRITE  (10/ 112)  ISTART /HAST  {send  block  nuabers 

FORMAT ("FIRST  0LQCI=  “ ,I2,5X,"LAST  BLOCI*  “ ,12) 

CALL  TEKD0TU50,200>  {start  of  next  lino 

WRITE  BINARY  (10)  ICONTROLU)  ;qo  to  Alpha  aodo. 

WRITE ( 10 , 120 ) IDATE  {send  date 

FORHATC'DATE:  ”,213,15)  { 

CALL  TEXD0T(650,478)  ;start  of  next  output* 

WRITE  BINARY  (10)  ICONTROLU)  ;*o  to  Alpha  node. 
WRITE( 10 ,125) IHOUR/IHIH ,ISEC  {send  tiao 
FORHAT("TIHE:  ”,313)  { 

CALL  TEKD0T(620,200)  {start  of  next  line. 

WRITE  BINARY  (10)  ICONTROLU)  \qo  to  Alpha  node. 
WRITE(10,130)IDB  {send  preeaphasis  info 

FORMAT (“FREQ*  PREEHPHASIS,  DB/OCTAVE:  “,I3) 

CALL  TEKB0T(590/200)  {start  of  next  line* 

WRITE  BINARY  (10)  ICONTROLU)  \qo  to  Alpha  node. 
WRITE(10,140)FREQ  {start  of  preeaphasis* 

FORMAT! "PREEHPHASIS  STARTS  AT,  FREQ. :  “,F5.0) 

CALL  TEXD0T(540,200)  {start  of  next  line* 

WRITE  BINARY  (10)  ICONTROLU)  {?o  to  Alpha  node. 
WRITE! 10,150)ENTHRES  {send  threshold  value. 

FORMAT! "RHS  ENERGY  THRESHOLD:  ”,F6.1) 

CALL  TEXD0T(530,200)  {start  of  next  line* 

WRITE  BINARY  (10)  ICONTROLU)  {qo  to  Alpha  node. 
WRITE! 10 ,160) ISIZE  {send  DFT  size* 

FORMAT ("DFT  SAMPLE  SIZE:  ”,13) 


CALL  TEXDOT  (500,200)  {start  next  line. 

WRITE  BINARY  (10)  ICONTROLU)  {Alpha 

1F(IHAH.EQ.1)WRITE( 10,141)  {Haaainq 

IF(IHAM.NE. DWRITE  (10, U2)  {Rectangular. 

FORMAT! "HAMMING  WINDOW  USED") 

FORMAT! "RBCTANGULAR  WINDOW  USED") 


I 


r>r»or>r»or>  oor> 


C 

C 
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165 


Hark  off  the  tioc  axis;  in  vectors. 

SO  165  JJ«l,5 
J*JJ-1 

INUH’dSTART  ♦  Jt5)»4  (vector  narking. 

1X=9  ♦  J»240  lurking  position. 

CALL  TEXD0T(40;IX)  (Start  line. 

WRITE  BINARY  (10)  ICONTROLU)  (to  Alpha  node. 
URITE(10;164)INUH  (send  narking. 

F0RNAKI3)  ( 

CONTINUE 


Write  out  an  axis  label. 


166 


CALL  TEKD0T(10;419) 

WRITE  BINARY  (10)  1C0NTR0L(6) 
WRITE! 10;166)  (tend 
FORMAT ("VECTOR  NUMBER") 


(start  line, 
(to  Alpha  node. 
1  abel . 
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Print  an  axis  for  the  spectrograa  on  the  screen 
of  the  Tektronix,  then  delineate  the  X-axis  every 
vector  and  delineate  the  Y-axis  every  frequency 
component.  The  subroutine  TEKLIHE 4 1 Y , IX , I Y 1 , 1X1 ) 

C  draas  a  line  froe  the  point  < IX ; I Y )  to  (IX1;IY1). 

C 


CALL  TEKLIHE( 474,991 ,69; 991) 

;Y-axis,  right. 

CALL  TEKL1NE( 474,29 ,69,29) 

(Y-axis,  left 

p 

CALL  TEkL I HE (69, 29, 69, 990) 

(X-axis  done 

u 

00  170  1X0*30,990,12 

(delineate  the 

CALL  TEXLINEt 66,1X0,76,1X0) 

(X-axis  first. 

p 

I/O 

CONTINUE 

l 

DO  175  1X0*30,990,120 

(put  extra  aark 

CALL  TEKLINE(76, 1X0,70, 1X0) 

(each  10  vectors. 

* 

r 

175 

CONTINUE 

DO  180  IYD*90 ,474,12 

(delineate  the 

CALL  TEXL1NE(IYD,26,1YD,16) 

(Y-axis,  left. 

p 

160 

CONTINUE 

l 

DO  165  IY0*90,474,96 

(put  extra  nark 

CALL  TEEL 1NE( I YD, 16, I YD, 6) 

(every  lXHz. 

r 

165 

CONTINUE 

l 

00  190  IYD*90,474,12 

(delineate  the 

CALL  TEXLlNEdYO, 992, 1Y0, 1002) 

(Y-axis, 

right. 

p 

190 

CONTINUE 

00  195  IY0*90,474,96 

(put  extra  eark 

CALL  TEKLINEdYO, 1002, IY0, 1012) 

(every  1 

XHz. 

195  CONTINUE 
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Nov  create  the  spectrograa  in  the  steps  outlined: 

1.  Clear  array  that  is  used  to  add  up  energy 

in  a  sentence • 

Hain  loop  of  program 

2.  Read  a  block  of  speech* 

3*  Set  B(J)  *  Magnitude  of  speech  components. 

Then  nindoe  as  requested! 

4*  Then  set  DTARAY  *  B(J),  so  array  is  complex. 

5*  Cal)  DPT  routine  to  get  freq.  info. 

Suh-loop  of  prograa  (once  for  each  freq) 

a.  Get  eagnitude  of  freq.  conponents. 

If  Haaainq  eindoe  used  noraalize  the 
■agnitude  of  the  freq.  components 
•rt  a  rectangular  eindoe  ( = 1 > . 

b.  Preeaphasize  frequencies  above  500  Hz. 

c.  Save  energy  coaponents  in  IENERGY. 

d.  Energy  noraalize  the  vector. 

End  of  sub-loop 

6.  Threshold  vector  ert.  energy. 

7.  Sua  total  energy  I  each  freq  in  TENERGY. 

8.  Change  B(J)'s  to  integers. 

8.  Store  freq  coaponents  of  each  vector 
in  DCOHPS. 

9.  Scale  freq  coaponents  froa  1  to  10. 

10.  Output  to  screen  one  vector  of  info. 

11.  Change  vector  number  and  GO  TO  «1. 

12.  When  done  save  TENERGY  and  RING  BELL. 


No*  do  it  to  it. 


200 

C 

C 


C 

300 


DO  200  1 1  - 1  ^  I  SI  ZEH  (dear  file  used  to 

REN£RGY(I1)*0.0  ;sua  up  energy. 


DO  900  1 2« ISTART  a  HAST  (begin  tain  loop 

COLL  RDBLK(5,I2,IDSP,1,IER)  (input  speech  info. 
IF(IER.HE.l)TYPEnERR0R  ON  RDBLI  *  ",IER 
J 1 3  0  (Keeps  track  of  ehether  to  read 

(another  block  of  speech. 

WND0*1.0  (Default  to  rectang.  eindoe. 

00  310  J*1,ISIZE  (fora  complex  array 

B(J)’IDSP(J*J1)  ;Jl  3  block  nuaber. 
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c 


IF  (IHAH.EO.l)  UNDO  *  0.54  -  (0.46  «  COS(  2. 
/  3.1415926536  »  (J  •  D/1S1ZE)) 

C 


C 

c 


c 


c 


c 

325 

C 


C 


432 


C 


C 


433 


C 


434 


B(J)  *  UNDO  •  B( J) 


DT AR AY  <J)=B(J) 
CONTINUE 


{Bust  call  DFT  «ith 
{coaplex  array. 


L ARAY « I  SI ZE  {size  for  DFT  call. 

INVER*0  {indicate  DFT  direction. 

CALL  DFT5(DTARAT ,LARAY , INVER)  {get  DFT  for  this  vector 


DO  325  I3*1*ISIZEH  {get  freg  conponent  aagn. 

B(I3) *CABS(DTARAY(I3)/LARAY)  { 


IF  (IHAN.EQ.l)  B(  13)  «  B(I3)  /  0.54  {nornalize 

{to  equivalent  eagnitude  as  a 
{rectangular  eindoe. 

CONTINUE 


IFdPRE.NE.DEO  TO  400  {to  not  preeephasize 

IFREQ1  =  IFIXl F REQ/SFREQ* I S I ZE )  + 1  {first  freq  preeeph. 


DO  400  IN«IFREQ1;ISIZEH  {preeeph.  freq  coeponents. 

RI=FLOAT(IN)  {change  integer  variables 

RIFRE01 *FL0AT( IFREQ1 )  {into  real  variables. 

RIDB*FLOAT(IDB)  { 

{Perfore  freq  preeaphasis  next. 

B(IN) :B( IN)t(10.**(RIDB/20*AL0G10(RI/RIFREQl )/AL0G10(2.0) ) ) 
CONTINUE 


SUHE:0«0 

DO  432  Js 1 f IS1ZEH 
SUHE=SUNE+(B(J)««2) 
CONTINUE 

ENERGY-SQRT(SUNE) 


{energy  nornalize  vectors. 
{suassquareroot  of  the 

{sun  of  the  squares. 


ISAHPPdSANP+l 

URITE ( 4 )  ISAHPP, ENERGY 


{vector  counter, 
{store  energy. 


IF ( ENERGY. GT.ENTHRES) GO  TO  433  {if  beloa  threshold 

{increase  energy  to  attenuate  coeponents. 

ENERGY*5.«ENTHRES  { 

CONTINUE 

IHULT*32767  {aax  possible  energy  value. 

DO  434  JdflSIZEH  {change  to  integers. 

B( J)*<B( J)/ENERCY)*IHULT  {nornalize. 


RENER6Y(J)>RENER6Y(J)«B(J)  {sua  energy. 

CONTINUE 
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DO  500  JJ*1 /ISIZEH 
IBI ( JJ) *IFIX(B( JJ) ) 
CONTINUE 


{array  valuta  convtrttd 

{to  integer  fortat. 


{save  vector  components  for  use  later 
CALL  WRITRW(19,ISAHP,IBI,1,IER4) 

IF< IER4.EQ.1.0R.IER4.EQ.312DC0  TO  510  { 

TYPE"  MRITRW  ERROR t  1ER4*  ",IER4  { 

DO  545  IN= 1  / ISIZEH  {scale  components 

IBI ( IN ) 5 IF1X( FLOAT (IBH IN) 1/32767 *10) +1 
IF(IBI(IN).LE.0)IBI(IN)*1  {froe  1  to  10. 

IF(  IBI  ( IN)  >GT<  10)  IBK  IN)  :10 
CONTINUE 


Output  the  coeponents  of  this  vector  not. 

DO  700  IN*lf ISIZEH  {for  each  freq.  component 

CALL  TEXTONE! ISAHP;1N;IBI< IN) )  ;dra«  pixel 
CONTINUE 


I3AHP*ISANP*1  {go  to  next  vector 

J13J1+1SIZE  {can  another  vector  be  foreed 

IF(J1.LT»256)C0  TO  300  {froe  this  block. 


CONTINUE 

End  of  tain  proqraa  loop. 

DO  910  IN>1, ISIZEH 

WRITE! 1 )  I /RENERCY ( IN) 


TYPE"  (7)  <7>  <7>  <7>  “  {sake  up  operator 

{for  interactive  node. 

Noe  qo  into  interactive  eode  on  the  Tektronix 
by  Baking  use  of  the  Gin  Bade  of  the  tektronix. 

This  section  of  the  proqraa  oil)  do  the  follonings 

a.  Send  ESC-SUB  to  put  the  Tektronix 
into  Gin  node  (oust  use  the  Bytepac 
routine  so  both  bytes  are  sent  in  a 
single  lord). 

b.  Tell  user  that  to  do  noo. 

c.  Decode  first  group  of  bytes  sent 
froe  the  Tektronix  into  X,Y  tekpoint 
position  of  starting  position. 

d.  Decode  second  group  of  bytes  tent 
into  the  nuaber  of  tiaes  you  eant 
to  hear  portion  of  spectrograo, 

and  the  final  X,Y  tekpoint  position. 

e.  Check  to  be  sure  that  the  second 


{save  total  energy 


C»**»»* 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 
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position  it  not  equal  to  the  first 
position  fohicb  occurs  ohen  user 
■ants  to  leave  tht  progran). 
f.  Transfer  control  to  tht  progran 

■hich  is  used  to  1/0  speech ,  this  is 
prograa  TALI.SV  ahich  is  used  to 
output  user  specified  seqaents 
of  a  speechfile.  This  is  done  utilizing 
the  eclipse's  D/A/D  converter* 
q«  Continue  eith  the  use  of  the 
cursor  routine* 

NOTE*  The  following  describes  the  bytes  sent  out 
fron  the  TeKtronix  terninal  then  too  Keys 
are  hit  ehile  in  the  Cin  node* 


♦ . ♦ . ♦ - ♦ . ♦ - ♦ . ♦ 

byte  1  byte  2  byte  3  byte  4  byte  5  byte  4 

♦ . ♦ . ♦ . ♦ . ♦ . + . 

first  byte  sent  last  byte  sent 

byte  1 . contains  the  ASCII  code  first  Key  hit 

byte  2 . contains  TeKtronix  coded  Hi  X  5-bit  byte 

byte  3 . contains  TeKtronix  coded  Lo  X  5-bit  byte 

byte  4. . .... .contains  TeKtronix  coded  Hi  Y  5-bit  byte 

byte  5 . contains  TeKtronix  coded  Lo  Y  5-bit  byte 

byte  4*. . contains  the  ASCII  code  last  Key  hit 


CALL  RESET 

Write  heading  to  file  ISTORE • 

CALL  F0PEN!3,""IST0RE"')  ? 

C 

CALL  FGTI  HE  (I  HOUR;  IN  IN;  I  SEC)  •, Current  tiae* 

C 

WRITE<3,917)  \ 

917  FORMAT!  12X, "'DOCUMENTATION  OF  WORK  FRON”)  ? 

WRITE ( 3 ;91 S )  ; 

918  FORMAT!  19X,"'T  E  K  T  A  L  X  ")  ; 

WRITE<3 ;920) IDATE  ; 

920  FORMAT! 10X;“DATE  s  ",213,151  ? 

WRITE  (3, 92  DIHOUR,  IM  IN,  I  SEC  ; 

921  F0RHAT(10X,"TIHE  :  ",3131  ; 

WRITE ( 3,922 ) FILENAM! 1)  ; 

922  FORMAT! 10X, "NAME  OF  SPEECHFILE  i  ”,S13)  { 

WRITE! 3,923 )T1TLE(1)  \ 

923  FORMAT!"  ",S79)  \ 

I  COUNT  *  1  ^counter  for  nueber  of  segeents. 

C 


oou  o  o  o  o  o  UUOUOUUOUUCJ 


IT*0 

990  CALL  RESET 

CALL  0PBM(3,"1STQRE“,0,IERA)  {docuaentation  file. 
IF ( 1ERA .ME • 1 ) TYPE” ERROR  0M  OPEN,  IERA:  “,IERA 


1000  CALL  BYTEPAC< ICOMTROL(l) )  {send  ESC-SUB  for 

CALL  BYTEPAC(IC0NTR0L(3))  {for  Gin  aode. 

READ  BINARY  (11)  1(1)  ,1(2)  ,1(3) 


IF(IT.E8.0)G0  TO  1010 
IHEAR1 ZIAND(INASX(2) ,1(1)) 
IHEAR1=ISHFT(I(1) ,-B) 


{first  position 
{»  of  tiaes  you 
{hear  portion. 


IHEAR2:IAND(INASI(1) ,1(3)) 


{2nd  Key  hit 


1010  IH(1):IAND(IRASK(1) ,1(1) ) 
I H ( 1 ) ZISHFT(IH( 1 ) ,5) 
IL(1)=IAND(INASK(2) ,1(2) ) 
IL(  1)  =  ISHFT( IL( 1 ) , - 8 ) 


;decode  the 
{X  position 
;of  cursor 


IH(2)-IAND(INASX(1) ,1(2))  {decode  the 
IH ( 2) = ISHFT ( 1H( 2) ,5)  {Y  position 
1L(2)-IAND(1HASK(2> ,1(3) )  {of  cursor. 
IL(2)ZISHFT(I(3) ,-B)  { 


The  first  and  last  bytes  froa  the  TeKtronix  carry 
only  the  ASCII  code  for  the  first  and  second  Keys  hit 
at  the  TeKtronix  terainal.  Hitting  the  first  Key  causes 
cursor  position  inforaation  to  be  sent  and  the  second 
Key  is  hit  to  fill  up  the  last  eord  (16  bit)  of  infor¬ 
aation  sent  to  the  coaputer.  These  tao  bytes  are 
exaained  and  decoded  upon  receipt  of  the  final  cursor 
position,  enabling  the  user  to  specify  the  nuaber  of 
tiaes  a  section  of  speech  eil)  be  heard. 


C 


1050 


C 

C 

C 


C 

1100 


IF(IT.EQ.1)G0  TO  1100 


WRITE  BINARY  (10)  IC0HTR0L(7) 

CALL  BYTEPACdCONTROL(l)) 

CALL  BYTEPAC( IC0NTR0LQ) ) 

GO  TO  1000 


{decode  position. 


{bel  to  reset. 

{send  ESC-SUB 
{in  one  eord 
{so  bacK  in  Gin.C 
{final  position. 


DO  1050  IN:1 ,2 

IU0RD1  (IN) z  I0RdH(  IN) ,  IL(IN) ) 
CONTINUE 
IT*1 


I T » 0  {on  receipt  of  both  positions. 
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jforn  position 


SO  1110  IH«1,2 

IV0RS2dH)*I0RdHdH>,IL<IH>) 

jin  tcKpointt  for  non* 

1110  CONTINUE 
C 

I  STOP* IVORS 1 ( 1 > - IU0RS2 ( 1 >  (check  for  «top 

IFdST0P.E9.01G0  TO  2001  { 

»****«  Calculate  crosshair  position  (X  only)  in  terns  of 
words/  nhere  256  words  *  1  block  *  4  vectors  *  32  nsec* 

ISWORD* ( I  START >256)  ♦  ( I U0RS1 ( 1 ) -30) *5  ♦ 

/  IFIX  ( FLOAT  (IWORDld)  -  301/3)  j 

INWORD* ( I W0RD2( 1 ) -IVORS 1 ( 1 ) )»5  4 
/  I F 1 X ( FLOAT ( 1U0RD2 ( 1 1  - 1 WORD 1 1 1 ) )/3)  ; 

Continue  filling  file  ISTQRE  eith  docunentation. 

VRITE(3,1299)  ; 

1299  F0RHATU1X," . “) 

WkITE<3,1300)  ICOUNT  \ 

1300  FORHATllOX, “SPEECH  SEGMENT  NUMBER:  "/I6> 

WK1TE(3;1305)  ISWORD  ; 

1305  FORMAT! 10X /"FIRST  WORD  HEARS  WAS:  ",16) 

WRITE(3f 1310)  INWORD  ; 

1310  FORHAT ( 10X /“NUMBER  OF  WORDS  HEARS:  “,Ii) 

RLEHCTH*  FLOAT UHVORS  /  0000.)  ; length  sec. 

URITBt  3 / 1315 )  RLEHCTH 
1315  FORHAT ( 1 OX , "SEC RENT  LENGTH  (SEC):  ’,F7.4) 

»**»»»  Calculate  nunber  of  tines  segnent  nil) 
be  heard  via  the  progran  TALI.SV. 

IHEARD*  10«(IAND(INASX(3) ,  I  HEARD  )  (ten's  position. 

IHEARS*  1HEARS  +  < I AND( IMASK 3) ,IHEAR2) )  -.entire  ». 


This  is  where  control  is  transferred  over  to 
progran  TALX.SV/  in  order  to  hear  the  speech.  The 
transfer  is  accomplished  using  a  "CALL  SWAP”. 

PARAH(8)  *  ISWORD  (These  are  the  para- 

PARAH(9)  *  IHWORS  jneters  used  by 

PARAH(IO)  *  IHEARD  (TALK.SV  for  outputting 

{speech. 

CALL  CFILW(”PARAH.AD”,2,IER5) 

IFCIER5  .HE.  1)  TYPE’CFILW  ERROR  *  ”,IER5 
CALL  0PEH(2/”PARAM.AD”/3/lER6) 

IFdERO  .HE.  1)  TtPE"0PEN  ERROR  *  ",IER6 
CALL  WRSEQ(2/PARAM/20/IER7) 

IFdER7  .HE.  1)  TYPE-VRSEQ  ERROR  *  ",IER7 
CALL  CL0SE(2/IER8) 

IFdERO  .HE.  1)  TYPE"CLOSE  ERROR  •  “,IER8 
CALL  SWAP! "TALI.SV" / IER9 ) 
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IFdER?  .MB.  1)  TYPE’SMAP  ERROR  *  “,IER9 

On  rtturn  fron  the  speech  output  subroutine; 
continue  in  the  interactive  loop  end  increment  counter. 

ICOUNT  >  1C0UMT  ♦  1 
GO  TO  m 
2001  CONTINUE 

««••••  On  receipt  of  interrupt  coooand  fora  Tektronix; 
notify  user  that  proqraa  is  aaiting  for  input;  then 
continue  as  directed. 

00  2010  IUAIT-1  ;4 

ACCEPT-  <7>  <7>  <7>  ” 

CALL  FDELAY (20) 

2010  CONTINUE 

TYPE"  < 15)  < 1 5 >  " 

CALL  BYTEPACdCONTROLd ) ) 

CALL  BYTEPACf ICOMTROL ( 2 >  > 

CALL  FDELAY(20) 

In  case  user  accidently  sent  the  saae  position  teice 
this  portion  of  the  proqraa  ail  I  alloa  continuation  of 
eork. 

TYPE"  INTERRUPT  CONNANO  RECEIVED  FRON  TEKTRONIX  (15)  " 

TYPE"  Enter  0  :  to  STOP." 

ACCEPT"  Enter  1  :  to  CONTINUE  IN  SANE  NODE  <15>U;IFLAC 

IF( 1FLAG . EQ .0) GO  TO  2020 

ISANP»0 

IBYTE*W 

CALL  BYTEPAC(IBYTE) 

CALL  RESET 

CALL  0PEN(5;FILEMAH;1;512;1ER1> 

OPEN  19,"DC0NPS“,LEN*1SIZE 

IF1IER1  .HE.  1)  TYPE" EROR  ON  OPEN;  IER1>  ”  ,IER1 
OPEN  1 ;“TENERGY" 

OPEN  4;"IENERGY" 

GO  TO  31 

2020  CALL  RESET 
STOP 
END 


{get  the 

{attention  of 
;the  user. 
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£****#» *»**##*****###*»*»******«****»***«*#**#*»»*»*** **♦*•♦*******»*»***»»*» 

C  lities  Ta i K 

C  futnc's  Lt  Aden 

C  Tate:  Dec  82 

C  Conpue  ecu:;,  vo. : 

FORTRAN  XAU 

C  loac  ccTsaT-.aj 

RL3R/>'  2  .<  TALK  3A-C0NrI3-  JSA'LIBS 

2  F  u  r. :  t :  c " : 

1  ’iis  p:s,s  sacs  ;e:tjor:  c*  1  ires: ' 

;  -  ~  *  r  s  J*  ;  ipH  C  /  V  •'  £  ®  .  1 Z  §  »  I  I  *  0  a  C  5  4  •'  C  4  i 

C  :  o  *■' ;  a  v?  3i  e  t  d  tf  ^  i  “  :u  "» *:  *?!**,  *  j  a  3 » m  0 v  *  * :  .  ^ :  3  r 

c/  ;;»*5  to  -eject  j'?>jacx.  T r  - s  t  -c;***  •  .-*.*•  as ’  t: 

2  2?  jsea.  0j»  -Jy-i.r;  tie  •:  aer  -'vst  >r:te  :•  ;;  -  ?.  -i  s  ■:  a  >*  s 

I  to  *ie  a-r.  men  »«*::•:«$  ip  t*»*  :  ‘c$r s*.  T  e  ;::e  .  »ec  to 

■;  a*;ve  the  Eclipse  A  S/A  cev.ee  :i  »r*-M  <  *  *  ?*c  s-  ou  ?. 

C  r:t  be  a  terse,  ".‘it' out  >■:  et  ;s  of  *o*  t*e  :e-.:s  iterate* 


:  1  .  5  -  'v  A*.  - 

. 


'  ^  ih»  Q  y 


;  _  p  r 
’  :  i  r  j 


:data3!.  :5i  *2  ■ 


I fv  ? 2 3 s«  1 3 F  Sr  i*  .  5  •  -  * 

1  ’i  i  2  3  c  r  r  A  3  A  1  <  1 0  ,  >  ? .  S  t>  „  X  « S  Li '  t  L  i  , :  i. .  • 

:r.i  - qfes  : 

CALL  *  C  2  i .!  i  1  . J  A  k  A  A  » £  0  « a  c  a  : 

*  ?  1  I E  R  •  ^  -  *  1  -  ^  T  “  -  ’  r\  J  '  3  *  e  $*,■*••*  ,  _  £  R 

-  -  :-i  v::  ■ 

r  I  :’BL;  :  1 1 1  .  i  r  A?' A'  •  :  '  -  1  i-  ‘ 
i  -  - A "  *  1  .  L £  .  2 5 6  »<«;.*•  i 

IF  i  ?  JRAIVV  ■  . &T  .5:  -  /  '-.;’BLF  =  >T  »  •!* 

iKI*  -?43A,1<  3  » -<  !  ?:R?LK*236  / 


:  ALL 

0:’EN  ( 1 

A 

pat,: 

>  I  EE  ; 

If  { ] 

:fr.ns. 

i ) 

:  t  ?  s 

“ 3  p t  h  error  , 

IEs," 

with. 

w  AL&* 

FiDBLK: 

* »? 

*  K  B  ^  K 

,ItATA3,NjrjLK 

,:E') 

r  n 

i.r  i , 

iES.Nc. 

1) 

T  if  PE 

“K’OBLX  error 

,;ek,' 

'  eitfi 

CALL 

CLOSE ( 

i  »I 

2P) 

IF  (. 

I  Eft . HE . 

1> 

XYr  E 

“Close  error  “ 

’  with 

IDATA1=*4000K  ;  <-* 

IDATA2  =  r-AKAH(*)  ;  <-- 
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CALL  DSTRTilER)  ; 

IF  (IER.nE.1)  CALL  ERRGAl “DSTST  error"  » 
1  =  1 

10  ILATA3(I)  =  :tATA3(J') 

:  =  iti 
c  •  c  ■*  * 

I ?  1 1 . 1 E « *■'  A  •< 1  r  i )  i  SO  ?'j  10 


CALL  * ' I T  J  > I . i : A  > 1 2 3 £ 2  , o  .  i » -  ’ A l  ,I3A?S2  , 
:?  CER.NE.l  TY-E  “i:r«  e-r:?  '  ,  IEA 


y  -  k + i 
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APPENDIX  B 

PHONEME  AMPLITUDE 
PLOTS 
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B.  Phoneme  Amplitude  Plots 

The  following  plots  are  a  representative  cross  section 
of  the  total  population  of  phonemes  that  showed  a  degradation 
in  performance.  A  sample  is  only  used  because  with  29  phonemes 
showing  a  degraded  performance,  this  could  account  for  ap¬ 
proximately  145  plots. 
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1  2 PHONEME  AMFL 


PHONEME  flMPL 


g PHONEME  flttPL. 


?  PHONEME  ftttPL 


PHONEME  AMPL. 


AMPLITUDES  FOR  PHONEME  #  29  FROM  FILE  FSP4A3U.01 
WORDS  SPOKEN  AND  SPEAKER'  8-3  BY  SEELA NOT 


PHONEME  AMPL 


1  :?  PHONEME  AMPL . 


i  ? PHONEME  PNPL . 


AMPLITUDES  FOR  PHONEME  »  29  FROM  FILE  4ATRY1V.01 
WORDS  SPOKEN  AND  SPEAKER'  8-3  BY  SEELANOT 


PHONEME  AMPL 


PHONEME  AMFl . 


AMPLITUDES  FOR  PHONEME  #  25  FROM  FILE  3 
WORDS  SPOKEN  ANO  SPEAKER *  8-3  BY  SEELANOT 


? PHONEME  AMPL 


AMPLITUDES  FOR  PHONEME  #  25  FROM  FILE  4ATRY2F.01 
ROS  SPOKEN  WO  SPEAKER'  8-3  BY  SEELANDT 


PHONEME  AMPL 


PHONEME  HMPL. 


PHONEME  AMPL 


■p  PHONEME  AMPL. 


g PHONEME  AMFL 


AMPLITUDES  FOR  PHONEME  #  20  FROM  FILE  1BTRY2F.0X 
ROS  SPOKEN  ANO  SPEAKER’  0-1 -2-3  BY  SEELANOT 


o  PHONEME  AMPL. 
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APPENDIX  C 


PLOTTING  THE 
SPECTRAL  COMPONENTS 
OF  A  SPEECHFILE 
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PHONEME  AMPL 


IdWU  3W3N0Hd 


AMPLITUDES  FOR  PHONEME  #  02  FROM  FILE  1BTRY2F.01 
RDS  SPOKEN  AND  SPEAKER*  0-1 -2-3  BY  SEELANDT 


t  ? PHONEME  AMPL. 


AMPLITUDES  FOR  PHONEME  »  02  PROM  PILE  3UFSP1B.01 
RDS  SPOKEN  ANO  SPEAKER'  0-1 -2-3  BY  SEELANOT 


o PHONEME  hMPL. 


AMPLITUDES  FOR  PHONEME  «  02  FROM  FILE  1 
WORDS  SPOKEN  AND  SPEAKER’  0-1 -2-3  BY  SEELi 


PHONEME  HMPL 


i  2  PHONEME  AMPL . 


AMPLITUDES  FOR  PHONEME  #  03  FROM  FILE  1 
WORDS  SPOKEN  ANO  SPEAKER*  0-1 -2-3  BY  SEEL 


t  ;■>  PHONEME 


AMPLITUDES  FOR  PHONEME  #  03  FROM  FILE  1BTRY5U 
OROS  SPOKEN  AND  SPEAKER*  0-1 -2-3  BY  SEELANOT 


* PHONEME  AMPL. 


AMPLITUDES  POR  PHONEME  #11  PROM  PILE  1BTRYEP.01 
WORDS  SPOKEN  ANO  SPEAKER*  0-1 -2-3  BY  SEELANOT 


PHONEME  AMPL 


C.  Plotting  the  Spectral  Components  of  a  Speechfile 

The  program  RECORD  was  used  to  plot  the  spectral  com¬ 
ponents  of  any  given  vector  of  a  speechfile.  This  was  used 
to  compare  the  results  of  a  64  point  DFT  with  a  128  point 
DPT,  both  performed  on  the  same  speechfile.  Figure  C.l  and  C.2 
illustrate  the  resulting  plots  for  a  128  point  DFT.  Figure 
C. 3  is  the  corresponding  plots  for  a  64  point  DFT.  As  can 
be  seen,  the  64  point  DFT  has  a  "smoothing"  action  associated 


with  it. 


PROGRAM  RECORD 


File: 

Language : 

Date: 

Author : 

Subject: 

Calling  Sequence: 


RECORD 
Fortran  5 
November  15,  1982 
J.  Fletcher 

Plot  of  Spectral  components  of  a 
Speechfile 

RECORD 


ARGUMENT 

TYPE 

ICOMP 

Integer 

IRCC 

Integer 

ISTART 

Integer 

IFILE 

string 

COMPNT 

Integer  Array 

PURPOSE : 

PURPOSE 

number  of  frequency  components 
sets  record  size 
first  component  read  from  file 
name  of  spectrum  file 
array  for  components 


This  program  is  used  to  plot  the  spectral  components 
of  a  speechfile. 

DESCRIPTION: 

Location:  FLETCHER.  DR 
ARGUMENT  STRUCTURE: 

ICOM?  =  number  of  frequency  components  per  vector 

in  integer  form.  It  is  of  the  power  of 
two  (i.e.  32,64  or  128). 

ISTART  =  integer  number  of  vector  desired  to  be 

used.  This  must  be  a  three  digit  number 

IFILE  =*  The  name  of  the  spectrum  file  to  be  used. 

It  is  in  a  7  character  Hollerith  string 
format. 


PROGRAM  USE: 


This  program  uses  a  subroutine  GRPH2  to  plot  the  spectral 
components  of  a  vector  on  the  Tektronix  4010  terminal.  The 
vector  and  spectrum  file  is  selected  by  the  user.  The  user 
must  also  instruct  the  program  as  to  how  many  spectral  com¬ 
ponents  there  are  per  vector.  GRPH.LB  must  be  loaded  with 
this  program. 
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NOTE: 


THIS  PROGRAM  IS  FOR  FORTRAN  S  AND  MUST  BE  RUN  ON  THE 
TEKTRONIX  4010  TERMINAL* 


This  program  uses  a  spcctrua  f &)•  created  by  the  program  SGRAH 
to  plot  the  spectra)  components  for  a  tiae  slice  on  the  TeKtronix 
4010  terminal*  The  user  has  the  option  of  choosing  his  spectrum 
file,  the  number  of  frequency  components  per  time  slice  (limited 
to  a  maximum  of  12S  spectral  components),  and  the  time  slice  he 
mishes  to  observe* 


DIMENSION  IBK12S),  C0HPNT(12A>,  IFILE(13) 

DO  10  1*1,120 
1BI ( I )  *  0 
COHPNT(I)  *  0 
CONTINUE 

ACCEPT"HOM  MANY  FREQ.  COMPONENTS  (POHER  OF  2>?”,IC0HP 

IREC  *  I COUP  »  2  5  Sets  record  size  fhen  opening  spectrum  file. 

ACCEPrVECTOR  TO  BE  PLOTTED  (ENTER  THREE  DIGITS):  " 

READ (11,20)  ISTART  ;  Time  slice  to  be  observed. 

F0RHATU3) 

ACCEPT "COMPONENTS  FROM  SPEECHFILE:" 

READ (11 ,30 )  IFILE(l)  {Name  of  speechfile  spectrum  came  from* 
FORMAT (S13) 

CALL  OPEN (2,IFILE,1,IREC , IER1 )  ;  Open  spectrum  file 

CALL  CHECK(lERl) 

CALL  READR(2, ISTART, IBI, 1 ,IER2)  5  Read  one  record  from  file. 
CALL  CHECK IER2) 

DO  40  1*1,120  ;  Create  a  real  array  of  components 

COHPNT(l)  *  FLOAT(IBKI) )  ;  Maximum  size  is  120  components* 

CONTINUE 

ACCEPT"PRINT  COMPONENTS  ON  PRINTER?  (1*Y/0*N)",IPRNT 
IFdPRNT  *EQ.  0)  GO  TO  00 
WRITE (12 ,50 >  (CONPNT(I) ,  1*1,120) 
F0RHAT(5X,F10.2,5X,F10.2,3X,F10.2,5X,F10.2) 


GRPH  2  prints  a  plot  of  values  that  are  in  COMPNT  array* 


CALL  CRPH2TVECT0R  FREQ.  COMPONENTS",!, COMPNT, U,IC0HP,0,XMIN,XNAX,0) 
WR1TE( 10,70)  ISTART,  IFILB(l)  \  Header  information  for  graph 
FORMAT<"PLOT  FOR  VECTOR  ",I3,"  FROM  FILE  ",813) 

RIAB(ll,00)  IPAU8 
FORMAT (SI) 


*  • 
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CALL  RESET 
CALL  ERS(l) 

CALL  TDELAY(2) 

ACCEPT’RETURN  ?  <1*Y/0*N)  MANS  \  Return  to  look  at  another  tiee  slice 

IF ( IANS  .EQ.  1)  CO  TO  5 

STOP 
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